Gene Amir_5309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5309 
Symbol 
ID8329511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6310806 
End bp6312149 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content68% 
IMG OID644945748 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003102976 
Protein GI256379316 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.574903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCAC TCCGATACGG CGGACTCGCC GCAGCCGTGC TCATGGCCAC CGCTGCCTGT 
GCAGGCGCGG GCGGCGGCGG TGGCGGGAAC GAGAACTCGA TCAACGTTCT CATGGTCAAC
AACCCGCAGA TGGAAGACCT CCAGAAGCTC ACCGCGGACA ACTTCACCAA GGACACGGGC
ATCACGGTGA ACTTCACCGT CCTGCCCGAG AACGACGTCC GCGACAAGAT CAGCCAGGAC
TTCTCCAGCC AGGCCGGGCA GTACGACGTC GCCACGATCT CCAACTACGA GACGCCGATC
TACGCCAAGA ACAACTGGCT GACCCCGCTC GACGAGTACG TCGCCAAGGA CTCCGGCTTC
GCCCAGGACG ACGTCCTGGA GTCGGTGCGC CAGTCCCTGA CCGCCGCCGA CGGCAAGGTC
TACGCCCAGC CGTTCTACGG CGAGTCCTCG TTCCTGATGT ACCGCAAGGA CATCATGGAC
GCCAAGGGCA TCACCATGCC GGAGAAGCCC ACCTGGCAGC AGGTCGCCGA CATCGCCGCC
CAGGTCGACG GCGCCGAGCC CGGCATGAAG GGCATCTGCC TGCGCGGCCA GCCCGGCTGG
GGCCAGCTGA TGGCGCCGCT CACCACGGTC GTCAACACCT TCGGCGGCAC CTGGTTCACG
AAGGACTGGC AGGCCCAGGT GAACTCCCCG GAGTTCAAGG AGGCCACCGA CTTCTACGTC
AACCTGGTCC GCGACCACGG TGAGAACGGC GCCCCGCAGG CCGGCTTCGC CGAGTGCCTG
AACAACATGA CCCAGGGCAA GGTCGCCATG TGGTACGACG CGACCTCCGC CGCCGGCCTC
CTCGAGGGCG CCGACTCGCC GGTGAAGGGC AAGCTCGGCT TCGCCCAGGC CCCCGTGGTC
AAGACCGACA GCTCGGGCTG GCTCTACACC TGGGCGTTCG GCATCCAGAA GGCCAGCAAG
AAGGCCGACA ACGCCTGGAA GTTCATCTCC TGGGCCTCCG GCAAGGGCTA CGAGGAGCTG
GCGGGCAAGT CCCTCGGCTG GTCGCGCGTC CCGGACGGCA AGCGCTCCTC CACCTACGCG
CGCCCCGAGT ACCTCGAGGC CAGCGGCACG TTCGCCAAGC AGGTCGAGGC CGCCATCTCC
GGCACCAAGC CGACCGACCC CGGCGTGCAG CCCCGCCCGG CCCCCGGCAT CCAGTTCGTC
GGCATCCCGG AGTTCACCGA CCTGGGCACC CAGGTCTCCC AGAAGATCAG CGCCGCGATC
GCCGGCTCCA CCACCGTCGA GCAGGCCCTG ACCGAGAGCC AGGCCCTGGC CGAGACCGTG
GCCGAGAAGA ACCGGGGCAA GTGA
 
Protein sequence
MKSLRYGGLA AAVLMATAAC AGAGGGGGGN ENSINVLMVN NPQMEDLQKL TADNFTKDTG 
ITVNFTVLPE NDVRDKISQD FSSQAGQYDV ATISNYETPI YAKNNWLTPL DEYVAKDSGF
AQDDVLESVR QSLTAADGKV YAQPFYGESS FLMYRKDIMD AKGITMPEKP TWQQVADIAA
QVDGAEPGMK GICLRGQPGW GQLMAPLTTV VNTFGGTWFT KDWQAQVNSP EFKEATDFYV
NLVRDHGENG APQAGFAECL NNMTQGKVAM WYDATSAAGL LEGADSPVKG KLGFAQAPVV
KTDSSGWLYT WAFGIQKASK KADNAWKFIS WASGKGYEEL AGKSLGWSRV PDGKRSSTYA
RPEYLEASGT FAKQVEAAIS GTKPTDPGVQ PRPAPGIQFV GIPEFTDLGT QVSQKISAAI
AGSTTVEQAL TESQALAETV AEKNRGK