Gene Amir_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3336 
Symbol 
ID8327526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3913209 
End bp3914822 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content74% 
IMG OID644943847 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003101087 
Protein GI256377427 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTGG ACCGCAGGGC TTTTTTAGGT CTTCTCGCCG CAGCGGGCGG AACGGCGCTG 
GTCGGGTGCG AGGCGCCTGC CCCGCGCGGC GGTTCCGCGC TCGGCGGCGA CGCGCTGGCG
GCGCTGCTGC CCGCGCACCG GCCGGTGGAG TTCGCCGAGC CCGACCTGCC CGGCGTCAAC
GGCTCGGTGC CCGGCTACCT GACCTACCCG GCCAACCCGG TGCGCGGCGT GCGGGGGCCG
GTGCTCAGCG GCGAGGTCAC CGCGATGACC CCGTCGTTCT GGCCGCCGGC GCCGGGGCCG
GGGCGGAACT CCTACTACGA CGCGGTCAAC GAGCGCCTCG GCGGCGCGGT GCGGTTCGAG
ACCGTGTCCG GGGCCGACTA CCAGGCCAAG CTCTCCGCCC TGATGGCGGC GCGGCAGGTG
CCGGAGCTGA CCGTGGTGCC CACCTTCACC ATGCCGCCCC GGTTCAGCGA GGGCGTGGGC
GAGGTGTTCC GCGACCTGAC CGACTTCCTG TCCGGCGAGC GCGTCGCCGA CTACCCGATG
CTGGCCAACA TCCCCACCGA CTCGTGGCAC GCGTGCGTGC ACAACGGGCG CCTGCACGGC
GTTCCCTACC CCGGCCAGCT GTTCCCCGAG GTGCTGTTCT ACCGGGACGA CGTGTTCGAG
CAGCTCGGGG TGGAGCCGCC GCGCAGCGCC GAGGAGTTCG CGGCGATGGC CAAGCGGCTC
AACGACCCCG CGAACGACCG GTGGGCGCTC GGCGACGTGT TCCGCTCGCT CGTCCGCGCC
TTCGGCGGCC GGGGCGACTG GGTGCGCGAC GACTCGGGCA AGCTGCTGAA CCAGCTCGAA
ACCCCCTGGT ACGCCGAGGC GGTGCGGTTC ACCCGGTCCC TGTACGACGC CGGGTGCGTG
CACCCGGACA TCGTGGCGGG CAACTGGAAC CGCGGCAACG AGCTGTTCGC GGCCAAGCGG
ATGATCGTCA ACCAGGGCGG GATGGGCGCG TGGGCCGAGC AGGTCGCGCA GCAGCGGCCC
GCCGACCCCG GTTTCCGGAT GACCGCGCTG CCGCTGTTCG CGCACGACGG CGGCGAGCCC
GCCTACCCGG TGGCCGCGCC GACGGTGATG GTCGCGTTCG TCCGCAAGGA CGTGACCGAC
GAGCGGGTGC GGGAACTGCT GCGGCTGTGC GACTTCGCCG CCGCGCCGAT CGGCACCGAG
GAGCACCGGC TGCTGCGGTA CGGGGTGGAG GGCGTGCACA GCGAGCGCGA CGCGCGGGGC
AACCCGGCGC TGACCCCGCT GGGGCAGAAG GAGATCACGC TGACCTACGG GTTCGCGGCG
GGCCCGCCGG AGGCGATCAC CACCACCGAC CACCCGGACC TCGTGCGGGC CCAGCACGCC
TGGTACGCGC GGGAGTGGGG ACACCAGACC AAGCCGCTGG CGTTCGGGCT GCGCCTGGAG
GAGCCGCCGG AGTTCGCGAC CCTGGCGAAG GAGTTCGCGG ACCGGACCAC GGACGTCCTG
CGCGGGCGGG CCGAGCTGTC CGAGGTGGAC GGACTGGGCG AGCGGTGGCG CAAGGCGGGC
GGGGATCGGC TGCGGGAGTT CTACGACAAG GCGCTGCGGG ACGCCGGGCG CTGA
 
Protein sequence
MPVDRRAFLG LLAAAGGTAL VGCEAPAPRG GSALGGDALA ALLPAHRPVE FAEPDLPGVN 
GSVPGYLTYP ANPVRGVRGP VLSGEVTAMT PSFWPPAPGP GRNSYYDAVN ERLGGAVRFE
TVSGADYQAK LSALMAARQV PELTVVPTFT MPPRFSEGVG EVFRDLTDFL SGERVADYPM
LANIPTDSWH ACVHNGRLHG VPYPGQLFPE VLFYRDDVFE QLGVEPPRSA EEFAAMAKRL
NDPANDRWAL GDVFRSLVRA FGGRGDWVRD DSGKLLNQLE TPWYAEAVRF TRSLYDAGCV
HPDIVAGNWN RGNELFAAKR MIVNQGGMGA WAEQVAQQRP ADPGFRMTAL PLFAHDGGEP
AYPVAAPTVM VAFVRKDVTD ERVRELLRLC DFAAAPIGTE EHRLLRYGVE GVHSERDARG
NPALTPLGQK EITLTYGFAA GPPEAITTTD HPDLVRAQHA WYAREWGHQT KPLAFGLRLE
EPPEFATLAK EFADRTTDVL RGRAELSEVD GLGERWRKAG GDRLREFYDK ALRDAGR