Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2365 |
Symbol | |
ID | 9246215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2813315 |
End bp | 2814565 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003680293 |
Protein GI | 297561319 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.618021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAGAC AGAGACCGCG GGCCACCGCC GTGGCCGCCG CGCTGTCGGC GGCCGCGCTG CTCGCCGCGG GGTGCGGTGG GGACGGGGGC GGGGACCCGA ACACCATCGA GCTGGTCGTG GCGCAGTACA CCGAGGGGAC CCAGCCCTAC TGGACCGACC TGATCCAGGA CTTCGAGGCC GACCACCCCG GCACGAGCGT CCGGCTGCGG GTGATCGGCT GGGACGACCT CCAGAACCAG GTCAACACGA TGGTGCAGAC CCGGCAGTTC CCCGACATCC TCAACACGAA CCTCTTCGCC GACTACGCCG AGGCGGGCCT GCTGCACCCG GCGCGGGACG TGCTGCCCGA GGACAAGTTC ACCGACTTCG TCCCGGTCCT GGCCGAGAAC GCCTCGCTGG AGGGTGAGCA GTACGCCCTG CCGTTCGTCG CGACGGTGAA CGCGATGTAC TACAACCGGA CCATCTTCGC CGAGGCCGGG ATCAGCGAGC CCCCGCGGAC CTGGGACGAG TTCCTGGAGG CGGCGGAGCG CGTCAAGGCG CTCCCCGGCG ACCACGTCCC CTACGCGCTG GCGCTGGGCT TCGACGGCGG CGACTACGAG TTCGGCACGT GGGCGCGCTC CAACGGCGGC GGCTGGAAGC AGGACGGCGA GTGGACGGTC AACAGCGACC GCAACGTCGC CACGCTGGAG TTCCTCCGGG ACCTGGTGGT GGAGCACGAA GCCACCCAGC CCAACCCCGG GCAGACCAAC CGCCCCGACG GCACGTGGCC GCTCTTCGCC CAGGGCAGGG CCGCCATGGT GTACGCGCCG CTGGGCGGCA GCGCGTTCCT GGACCCGGTG CACGAGGCGG GCGTGGACTA CGGCACGACG ACCCACCCGA CCAACGGCGG CGCCGAGCCC TCCACCCACG GCATCCAGGA CTACCTGGTG GCCTTCGACA ACCCCGGCAA CCAGGAGCTG GTCACCGAGT TCCTGGACTA CTTCTACGAA CCGGAGAACT ACACCGCCTA CCTGGAGGTC GAGGGGCTGC TGCCGACCAC CGAGTCCGGC GTCGAGGAGT TCCGCGACGA CCCCGACGTG GGGCAGTACG TCGAGCAGAT CCCCGAGGCA CGGCTGGACC CCACCTACGA ACCGGTCTGG GCCCAGCTGC GCGGCACGAT GGCCGGGGAG CTGGGCACGG CCGTGGCCCC GGACGGGGAC CCGCGCGCCG TCCTGGACAG GGGCCAGGAG ATCGCCGCCT CCGGCCCGTG A
|
Protein sequence | MRRQRPRATA VAAALSAAAL LAAGCGGDGG GDPNTIELVV AQYTEGTQPY WTDLIQDFEA DHPGTSVRLR VIGWDDLQNQ VNTMVQTRQF PDILNTNLFA DYAEAGLLHP ARDVLPEDKF TDFVPVLAEN ASLEGEQYAL PFVATVNAMY YNRTIFAEAG ISEPPRTWDE FLEAAERVKA LPGDHVPYAL ALGFDGGDYE FGTWARSNGG GWKQDGEWTV NSDRNVATLE FLRDLVVEHE ATQPNPGQTN RPDGTWPLFA QGRAAMVYAP LGGSAFLDPV HEAGVDYGTT THPTNGGAEP STHGIQDYLV AFDNPGNQEL VTEFLDYFYE PENYTAYLEV EGLLPTTESG VEEFRDDPDV GQYVEQIPEA RLDPTYEPVW AQLRGTMAGE LGTAVAPDGD PRAVLDRGQE IAASGP
|
| |