Gene Ndas_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4058 
Symbol 
ID9247930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4854227 
End bp4855426 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID 
Productmonosaccharide-binding protein 
Protein accessionYP_003681960 
Protein GI297562986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCC CCCGCGTGCT GCTCGCCGGC GCCGCCGGCC TGACCCTGAC GCTCACGGCG 
TGTACGACGG ACGCCCCCAC CGACGCCCCC GAGGAGGCGG GCTCCGAACT CACCCCCGAC
GGGGAGTGGT TCGACGAGGC CGAGTTCGAG GCGCAGCTGG CCCAGCGCGA GATCACCCCG
GAGGGCCCCG AGGACCAGCC CTGGCTCCAG GCGATCGAGC CCGAGTGGAT CGACACCTCG
GAGTTCACGC ACGACGCGCC AGAGGACGCG ACGCTGTGCT TCTCCAACGC CTCGGTGTCC
AACCCCTGGC GCGTCACCGG CTTCATCACC ATGGAGCAGC AGGTGGAGGC GCTCCAGGAG
GAGGGGCGCA TCGGCGAGTT CCGCGTGTCG GACGCCGCCG ACGACGACAA CCAGCAGATC
TCCGACATCC AGGCCTTCGT GGACTCCGGG GACTGCGACG TCATCATCAT CTCCCCCTCC
ACTACCGCGA CCCTGACCCC GGCGGTGGAG ACCGCCTGCG AGAGCGGCGT CCCGGTCGTG
GTCTTCGACC GCGGCGTGAA CAGCGACTGC ATGGTCACGT TCATCCACCC GATCGGCGGC
TACGCCTACG GCGCGGACGC GGCCGAGTTC CTGGTCGATG AGCTGGAGCC CGGCTCGACC
GTGCTGGCGC TGCGCATCCT GCCCGGCGTG GACGTGCTCG AACACCGCTG GGCGGCGGCC
CAGGAGGTCT TCGCCGACAG CGAGCTGGAG GTGCTCGGCC ACGAGTTCAC CGAGGGCGAC
GGCGCCATGA TCAAGGACCT GGTCTCCCAG CACCTCCAGC GCGGCGAGGT CGACGGCATC
TGGATGGACG CCGGGGACGG CGCCGTGGCC GCCCTGGAGG CCTTCGAGGA CGCGGGCCAG
CCCTACCCGG TGATCTCCGG TGAGGACGAG CTGAGCTTCA TGCGCAAGTG GCAGGAGGAG
GACCTCACCG CGATCGCGCC CGTCTACTCC AACTTCCAGT GGCGGACCCC GGTCCTGGCC
GCCGGCATGA TCCTCGCCGG CGAGGAGGTG CCCTCGGAGT GGATCCTGCC GCAGGAGCCG
ATCCGTCAGG ACGAGCTGGA CGAGTACCTG GAGCGCAACG CGGAGATGCC GTCCCTGCAC
TACGCGAAGT TCGGCGGCGA GGACCTGCCG GGCTTCCCCG AGGCCTGGAC GGACCGGTAG
 
Protein sequence
MRVPRVLLAG AAGLTLTLTA CTTDAPTDAP EEAGSELTPD GEWFDEAEFE AQLAQREITP 
EGPEDQPWLQ AIEPEWIDTS EFTHDAPEDA TLCFSNASVS NPWRVTGFIT MEQQVEALQE
EGRIGEFRVS DAADDDNQQI SDIQAFVDSG DCDVIIISPS TTATLTPAVE TACESGVPVV
VFDRGVNSDC MVTFIHPIGG YAYGADAAEF LVDELEPGST VLALRILPGV DVLEHRWAAA
QEVFADSELE VLGHEFTEGD GAMIKDLVSQ HLQRGEVDGI WMDAGDGAVA ALEAFEDAGQ
PYPVISGEDE LSFMRKWQEE DLTAIAPVYS NFQWRTPVLA AGMILAGEEV PSEWILPQEP
IRQDELDEYL ERNAEMPSLH YAKFGGEDLP GFPEAWTDR