Gene ECH74115_3254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3254 
Symbol 
ID6967225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2986716 
End bp2987642 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content55% 
IMG OID643387067 
ProductABC transporter, quaternary amine uptake (QAT) family, ATP-binding protein 
Protein accessionYP_002271531 
Protein GI209398032 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.000625793 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGAAT TTAGCCATGT CAGCAAACTG TTCGGCGCAC AAAAAGCCGT TAACGATCTC 
AATCTCAATT TTCAGGAAGG GAGTTTTTCG GTGCTGATTG GCACATCTGG CTCCGGCAAA
TCCACCACCC TGAAAATGAT TAACCGCCTG GTGGAACATG ACAGCGGAGA GCTCCGCTTT
GCTGGAGAAG AAATTCGCTC GCTGCCAGTA CTGGAGTTGC GCCGCCGTAT GGGCTATGCC
ATTCAATCTA TTGGCCTGTT TCCCCACTGG AGCGTGGCAC AAAACATTGC CACCGTGCCG
CAATTACAAA AATGGTCGCG GGCGAGGATT GACGATCGTA TCGACGAATT AATGGCGCTA
CTGGGGCTGG AGTCAAATTT ACGTGAGCGT TATCCACATC AGCTTTCCGG TGGTCAGCAG
CAACGTGTGG GAGTAGCGCG CGCACTGGCT GCCGATCCGC AAGTCTTACT GATGGATGAA
CCTTTTGGCG CACTGGACCC GGTAACGCGC GGCGCGTTGC AACAAGAGAT GACGCGCATT
CACCGTTTGC TGGGGCGTAC CATTGTGCTG GTCACGCATG ATATTGATGA GGCGCTACGG
CTGGCAGAAC ATCTGGTATT GATGGATCAC GGTGAAGTGG TGCAGCAGGG GAATCCGCTG
ACGATGCTGA CTCGTCCGGC GAATGATTTT GTCCGCCAGT TTTTTGGACG TAGTGAACTG
GGTGTGCGCC TGCTTTCGTT ACGTAGTGTG GCGGATTACG TGCGTCGCGA AGAACGGGCA
GAAGGTGAGG CACTGGCAGA AGAGATGACG CTGCGCGATG CGCTCTCCCT GTTTGTCGCG
CGGGGATGCG AGGTGCTGCC GGTGGTGAAC ACGCAGGGCC AGCCTAGCGG CACGCTGCAT
TTTCAGGATC TGCTGGTGGA GGCGTAA
 
Protein sequence
MIEFSHVSKL FGAQKAVNDL NLNFQEGSFS VLIGTSGSGK STTLKMINRL VEHDSGELRF 
AGEEIRSLPV LELRRRMGYA IQSIGLFPHW SVAQNIATVP QLQKWSRARI DDRIDELMAL
LGLESNLRER YPHQLSGGQQ QRVGVARALA ADPQVLLMDE PFGALDPVTR GALQQEMTRI
HRLLGRTIVL VTHDIDEALR LAEHLVLMDH GEVVQQGNPL TMLTRPANDF VRQFFGRSEL
GVRLLSLRSV ADYVRREERA EGEALAEEMT LRDALSLFVA RGCEVLPVVN TQGQPSGTLH
FQDLLVEA