Gene Amir_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1785 
Symbol 
ID8325970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp1961227 
End bp1962501 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content67% 
IMG OID644942334 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003099579 
Protein GI256375919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.155279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCAACC GTCGAGCAGG CCTGGGTAGC CTCCTGGCCT GCACCGCAGT GCTCGCAGCG 
GTCACCACCG CCTGCGGTTC GGACAGTGGG ACCAGTGCCG ATGGCAAGAT CGAACTCACC
ATCGCCACGT TCAACCAGTT CGGGTACGAG GAGCTCCTCA AGGAGTACGA GGCGGCTCAC
CCCAACATCA AGGTGAGCGA GCGCAAGACC GGTCAGGCCG CTGACCACCA CAAGAACCTC
TTCACCAAGA TGGCGGCGGG CTCCGGCCTC TCCGACGTCG AGGGCGTCGA GGAGGGCTAC
CTCAGCCAGG TCATGACCCG CGCGGGCCAG TTCAACAACC TCAAGGAGAT CGGCCCGAAC
GTCGACGGCC GCTGGCTGGA CTGGAAGACC AAGTCGGTCA CGGCCAAGGG CGGCGAGCTG
ATCGGCTACG GCACCGACAT CGGCCCGCTG GCCATGTGCT ACCGCAAGGA CCTCCTGGAG
GCCGGTGGCA TCCCCACCGA CGAGGCAGGC ATCGCCGCCA CGTTCGCCAC CTGGGACTCG
TACTTCGCCG CGGGCAAGCA GTACGCGGAG AAGACCGGCA AGGCCTGGTT CGACTCGGCC
GCGCAGATCT TCAACCCGAT GCACAACCAG GCCGAGCTCG GCTACTTCGA CAAGGACGAC
AAGCTCGTCA TCGACTCCAA CGGCGACAAG GCCATCTGGG GCAAGGTGAC CGCCGCCGTC
GCGCAGGGCC AGTCCGCGAA GCTCAAGGCG TGGACCCCCG AGTGGGAGAC CGGCTTCCGC
GAGTCGGCCT TCGCCACCAA GACCTGCCCG GCGTGGCTGC TCGGCAACAT CGAGAAGAAC
TCCGGCCCCG AGCACAAGGG CAAGTGGGTC GTCACCGGCT CCTTCCCCGA CGGCGGCGGC
AACTGGGGCG GCTCGTTCCT GACCGTGCCC AAGCAGAGCA AGCACCCGAA GGAAGCCGCT
GAGCTGGCTG CCTGGCTGAC CGCGCCCGAG CAGCAGATCA AGGCGTTCAA GGCCAAGAAC
ACCTTCCCGA GCCAGCTCGA GGCGCTCGAC AACCCCGAGC TGACCGAGAG CACCAACGAG
TACTTCGGCC AGCAGGTCGG CAAGCTCTAC GCGGAGCAGG CCAAGAAGGT CGACGTGGCC
CAGTACAAGG GCCCGAAGGA CGGCCAGATC CAGGACGACA TCGTCGGCCC GGCGCTGCTG
TCGGTCGAGC AGGGCACCAA CGCGGACGAG GCGTGGAAGA AGGTCGTCGA GGACGCCCAG
AAGGCGACCA AGTGA
 
Protein sequence
MRNRRAGLGS LLACTAVLAA VTTACGSDSG TSADGKIELT IATFNQFGYE ELLKEYEAAH 
PNIKVSERKT GQAADHHKNL FTKMAAGSGL SDVEGVEEGY LSQVMTRAGQ FNNLKEIGPN
VDGRWLDWKT KSVTAKGGEL IGYGTDIGPL AMCYRKDLLE AGGIPTDEAG IAATFATWDS
YFAAGKQYAE KTGKAWFDSA AQIFNPMHNQ AELGYFDKDD KLVIDSNGDK AIWGKVTAAV
AQGQSAKLKA WTPEWETGFR ESAFATKTCP AWLLGNIEKN SGPEHKGKWV VTGSFPDGGG
NWGGSFLTVP KQSKHPKEAA ELAAWLTAPE QQIKAFKAKN TFPSQLEALD NPELTESTNE
YFGQQVGKLY AEQAKKVDVA QYKGPKDGQI QDDIVGPALL SVEQGTNADE AWKKVVEDAQ
KATK