Gene ECH74115_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3921 
SymbolproV 
ID6972082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3633203 
End bp3634405 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content49% 
IMG OID643387695 
Productglycine betaine transporter ATP-binding subunit 
Protein accessionYP_002272143 
Protein GI209398437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.133868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTA AATTAGAAAT TAAAAATCTT TATAAAATAT TTGGTGAGCA TCCACAGCGA 
GCGTTCAAAT ATATCGAACA AGGACTTTCA AAAGAACAAA TTCTGGAAAA AACCGGGCTA
TCGCTTGGCG TAAAAGACGC CAGTCTGGCC ATTGAAGAAG GCGAGATATT TGTCATCATG
GGATTATCCG GCTCGGGTAA ATCCACAATG GTACGCCTTC TCAATCGCCT GATTGAACCC
ACCCGCGGGC AAGTACTGAT TGATGGTGTG GATATTGCCA AAATATCCGA CGCCGAACTC
CGTGAGGTGC GCAGAAAAAA GATTGCGATG GTCTTCCAGT CCTTTGCCTT AATGCCGCAT
ATGACCGTGC TGGACAATAC TGCGTTTGGT ATGGAATTGG CCGGAATTAA TGCCGAAGAA
CGCCGGGAAA AAGCCCTTGA TGCACTGCGT CAGGTCGGGC TGGAAAATTA TGCCCACAGC
TACCCGAATG AACTCTCTGG CGGGATGCGT CAACGTGTGG GATTAGCCCG CGCGTTAGCG
ATTAATCCGG ATATTTTATT AATGGATGAA GCCTTCTCGG CGCTCGATCC ATTAATTCGC
ACCGAGATGC AGGATGAGCT GGTAAAATTA CAGGCGAAAC ATCAGCGCAC CATTGTCTTT
ATTTCCCACG ATCTCGATGA AGCCATGCGT ATTGGCGACC GAATTGCCAT TATGCAAAAT
GGTGAAGTGG TACAGGTCGG CACACCGGAT GAAATTATCA ATAATCCGGC GAATGATTAT
GTCCGTACCT TCTTCCGTGG CGTTGATATT AGTCAGGTAT TCAGTGCGAA AGATATTGCC
CGCCGGACAC CGAATGGCTT AATTCGTAAA ACCCCTGGCT TCGGCCCACG TTCGGCACTG
AAATTATTGC AGGATGAAGA TCGTGAATAT GGCTACGTTA TCGAACGCGG TAATAAGTTT
GTCGGCGCAG TCTCCATCGA TTCGCTTAAA ACCGCGTTAA CGCAGCAGCA AGGTCTTGAT
GCGGCGCTGA TTGATGCGCC GTTAGCAGTC GATGCACAAA CGCCTCTTAG CGAGTTGCTC
TCTCATGTCG GACAGGCTCC CTGTGCGGTG CCCGTGGTCG ACGAGGACCA ACAGTATGTC
GGCATCATTT CGAAAGGAAT GCTGCTGCGC GCTTTAGATC GTGAGGGGGT AAATAATGGC
TGA
 
Protein sequence
MAIKLEIKNL YKIFGEHPQR AFKYIEQGLS KEQILEKTGL SLGVKDASLA IEEGEIFVIM 
GLSGSGKSTM VRLLNRLIEP TRGQVLIDGV DIAKISDAEL REVRRKKIAM VFQSFALMPH
MTVLDNTAFG MELAGINAEE RREKALDALR QVGLENYAHS YPNELSGGMR QRVGLARALA
INPDILLMDE AFSALDPLIR TEMQDELVKL QAKHQRTIVF ISHDLDEAMR IGDRIAIMQN
GEVVQVGTPD EIINNPANDY VRTFFRGVDI SQVFSAKDIA RRTPNGLIRK TPGFGPRSAL
KLLQDEDREY GYVIERGNKF VGAVSIDSLK TALTQQQGLD AALIDAPLAV DAQTPLSELL
SHVGQAPCAV PVVDEDQQYV GIISKGMLLR ALDREGVNNG