Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4058 |
Symbol | |
ID | 9247930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4854227 |
End bp | 4855426 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | monosaccharide-binding protein |
Protein accession | YP_003681960 |
Protein GI | 297562986 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTCC CCCGCGTGCT GCTCGCCGGC GCCGCCGGCC TGACCCTGAC GCTCACGGCG TGTACGACGG ACGCCCCCAC CGACGCCCCC GAGGAGGCGG GCTCCGAACT CACCCCCGAC GGGGAGTGGT TCGACGAGGC CGAGTTCGAG GCGCAGCTGG CCCAGCGCGA GATCACCCCG GAGGGCCCCG AGGACCAGCC CTGGCTCCAG GCGATCGAGC CCGAGTGGAT CGACACCTCG GAGTTCACGC ACGACGCGCC AGAGGACGCG ACGCTGTGCT TCTCCAACGC CTCGGTGTCC AACCCCTGGC GCGTCACCGG CTTCATCACC ATGGAGCAGC AGGTGGAGGC GCTCCAGGAG GAGGGGCGCA TCGGCGAGTT CCGCGTGTCG GACGCCGCCG ACGACGACAA CCAGCAGATC TCCGACATCC AGGCCTTCGT GGACTCCGGG GACTGCGACG TCATCATCAT CTCCCCCTCC ACTACCGCGA CCCTGACCCC GGCGGTGGAG ACCGCCTGCG AGAGCGGCGT CCCGGTCGTG GTCTTCGACC GCGGCGTGAA CAGCGACTGC ATGGTCACGT TCATCCACCC GATCGGCGGC TACGCCTACG GCGCGGACGC GGCCGAGTTC CTGGTCGATG AGCTGGAGCC CGGCTCGACC GTGCTGGCGC TGCGCATCCT GCCCGGCGTG GACGTGCTCG AACACCGCTG GGCGGCGGCC CAGGAGGTCT TCGCCGACAG CGAGCTGGAG GTGCTCGGCC ACGAGTTCAC CGAGGGCGAC GGCGCCATGA TCAAGGACCT GGTCTCCCAG CACCTCCAGC GCGGCGAGGT CGACGGCATC TGGATGGACG CCGGGGACGG CGCCGTGGCC GCCCTGGAGG CCTTCGAGGA CGCGGGCCAG CCCTACCCGG TGATCTCCGG TGAGGACGAG CTGAGCTTCA TGCGCAAGTG GCAGGAGGAG GACCTCACCG CGATCGCGCC CGTCTACTCC AACTTCCAGT GGCGGACCCC GGTCCTGGCC GCCGGCATGA TCCTCGCCGG CGAGGAGGTG CCCTCGGAGT GGATCCTGCC GCAGGAGCCG ATCCGTCAGG ACGAGCTGGA CGAGTACCTG GAGCGCAACG CGGAGATGCC GTCCCTGCAC TACGCGAAGT TCGGCGGCGA GGACCTGCCG GGCTTCCCCG AGGCCTGGAC GGACCGGTAG
|
Protein sequence | MRVPRVLLAG AAGLTLTLTA CTTDAPTDAP EEAGSELTPD GEWFDEAEFE AQLAQREITP EGPEDQPWLQ AIEPEWIDTS EFTHDAPEDA TLCFSNASVS NPWRVTGFIT MEQQVEALQE EGRIGEFRVS DAADDDNQQI SDIQAFVDSG DCDVIIISPS TTATLTPAVE TACESGVPVV VFDRGVNSDC MVTFIHPIGG YAYGADAAEF LVDELEPGST VLALRILPGV DVLEHRWAAA QEVFADSELE VLGHEFTEGD GAMIKDLVSQ HLQRGEVDGI WMDAGDGAVA ALEAFEDAGQ PYPVISGEDE LSFMRKWQEE DLTAIAPVYS NFQWRTPVLA AGMILAGEEV PSEWILPQEP IRQDELDEYL ERNAEMPSLH YAKFGGEDLP GFPEAWTDR
|
| |