Gene ECH74115_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3923 
SymbolproX 
ID6967264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3635519 
End bp3636511 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID643387697 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_002272145 
Protein GI209399990 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.334646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT 
GCTGCCGACC TGCCGGGCAA AGGCATTACT GTTAATCCGG TTCAGAGCAC CATCACCGAA
GAAACTTTCC AGACGCTGCT GGTCAGCCGC GCGCTGGAGA AATTAGGTTA TACCGTCAAT
AAACCCAGCG AAGTGGATTA CAACGTTGGC TACACCTCGC TTGCTTCCGG CGATGCAACC
TTCACCGCCG TGAACTGGAC GCCACTGCAT GACAACATGT ACGAAGCTGC CGGTGGCGAT
AAGAAATTTT ATCGTGAAGG GGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT
AAGAAAACCG CCGACCAGTA CAAAATCACC AACATCGCAC AACTGAAAGA TCCGAAGATC
GCCAAACTGT TCGATACCAA CGGCGACGGA AAAGCGGATT TAACCGGTTG TAACCCTGGC
TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGAACTGAC CAATACCGTG
ACGCATAATC AGGGGAACTA CGCGGCGATG ATGGCCGACA CCATCAGTCG TTACAAAGAG
GGCAAACCGG TGTTTTACTA CACCTGGACG CCGTACTGGG TGAGTAATGA GCTGAAGCCA
GGCAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCTGCAC TGCCGGGCGA TAAAAATGCC
GATACCAAAC TGCCGAATGG CGCGAATTAC GGCTTCCCGG TCAGCACCAT GCATATCGTT
GCCAATAAAG CCTGGGCCGA GAAAAACCCG GCAGCAGCGA AACTGTTTGC CATTATGCAG
TTGCCAGTGG CAGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA
GGCGATATTC AGGGCCACGT TGATGGCTGG ATCAAAGCCC ACCAGCAACA GTTCGATGGC
TGGGTGAATG AGGCGTTGGC AGCGCAGAAG TAA
 
Protein sequence
MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN 
KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID
KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYELTNTV
THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA
DTKLPNGANY GFPVSTMHIV ANKAWAEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE
GDIQGHVDGW IKAHQQQFDG WVNEALAAQK