Gene Amir_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2049 
Symbol 
ID8326238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2273320 
End bp2274330 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID644942600 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003099841 
Protein GI256376181 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCA GGAGCAGGAG GGGCCTGGTC GCGATCAGCG CGGCGCTGTG CCTCGCGGTG 
TCCGGCTGCG GGCTGGAGAG CTCGTTCGCG CTGCCGTTCG AGGTGGCGCC CGGCTCGATC
AGGCCGGAGC CCGCGCTGGA GGGCGTGAAG GTCTCGGTGG GGTCCAAGGA CTTCACCGAG
AACATCGTGC TCGGCTACAT CACCGAGGTC GCGCTGGCGG CGGCGGGCGC CGAGGTCAAC
GACATGACCA ACATCCAGGG CTCGAACAGC TCGCGGCAGG CGCTGCTGAC CGGTGACGCC
GACCTGTCCT GGGACTACAG CGGGACGGGC TGGATCAGCT ACCTGGGCAA CACCGAGCCG
ATTCAGGGCG AGCGCGAGCA GTACGAGGCG GTGCGCGAGG CCGACCTGGA GCGCAACGGC
CTGGTGTGGC TGGACTACAC GAAGGTCAAC AACACCTACG CCTTTGCCGT CACGCGGGAG
TTCGCCGAGG CCAACAACCT CAGCACCACG AGCCAGATGG CCGAGCTGGT GAAGAACGAC
CCCGGCAAGG GCGTGTTCTG CCTGGAGACC GAGTTCATCA GCCGCAACGA CGGGTTCCCC
GGCGTGGCGC AGACCTACGG CTTCGACGCG GGCGCGGCGC AGGTGAAGAC CTTCGGCAGC
GGCACGATCT ACACCGCGAC CTCCGACGGG CTGTGCAACT TCGGCGAGGT GTTCACCACC
GACGGCCGCA TCCTGGCGCT GGACCTGGTG GTGCTGGAGG ACGACAAGAA GTTCTTCCCC
CAGTACAACG CCTCCCTGGT GCTGCGCGAG GAGTTCCACG AGCAGCACCC GGAGATCGAG
CGGATCATGA CGCCGGTGTT CGAGAAGCTG GACAACGACA CGATCATCCG GCTGAACGCC
GAGGTGGACG TGGACGGCCG CGACCCGGCG GCGGTGGCGC GCGACTGGAT GGTGGGCGAG
GGGTTCGTGT CGATCCCCGA CGACACCATG GCGGCGGGCG CGCGGCGCTA G
 
Protein sequence
MSGRSRRGLV AISAALCLAV SGCGLESSFA LPFEVAPGSI RPEPALEGVK VSVGSKDFTE 
NIVLGYITEV ALAAAGAEVN DMTNIQGSNS SRQALLTGDA DLSWDYSGTG WISYLGNTEP
IQGEREQYEA VREADLERNG LVWLDYTKVN NTYAFAVTRE FAEANNLSTT SQMAELVKND
PGKGVFCLET EFISRNDGFP GVAQTYGFDA GAAQVKTFGS GTIYTATSDG LCNFGEVFTT
DGRILALDLV VLEDDKKFFP QYNASLVLRE EFHEQHPEIE RIMTPVFEKL DNDTIIRLNA
EVDVDGRDPA AVARDWMVGE GFVSIPDDTM AAGARR