Gene ECH74115_5624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5624 
SymbolproP 
ID6969606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5260151 
End bp5261653 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content50% 
IMG OID643389258 
Productproline/glycine betaine transporter 
Protein accessionYP_002273655 
Protein GI209400808 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAA GGAAAAAAGT AAAACCGATT ACCCTTCGTG ATGTCACCAT TATTGATGAC 
GGTAAACTGC GTAAAGCCAT TACCGCAGCA TCACTGGGTA ATGCAATGGA ATGGTTTGAT
TTTGGTGTTT ATGGTTTTGT TGCTTACGCA TTAGGTAAAG TTTTTTTCCC GGGGGCTGAC
CCCAGCGTGC AGATGGTTGC TGCACTTGCC ACTTTCTCCG TTCCCTTTCT GATTCGACCG
CTTGGCGGAC TCTTCTTTGG TATGTTGGGC GATAAATATG GTCGCCAGAA GATCCTCGCT
ATCACTATTG TGATTATGTC GATCAGTACG TTCTGTATTG GCTTAATACC GTCCTACGAC
ACGATTGGTA TTTGGGCACC GATTCTGCTG TTGATCTGTA AGATGGCACA AGGTTTCTCG
GTCGGCGGTG AATATACCGG GGCGTCGATA TTTGTTGCGG AATACTCCCC TGACCGTAAA
CGTGGCTTTA TGGGCAGCTG GCTGGACTTC GGTTCTATTG CCGGGTTTGT GCTGGGTGCG
GGCGTGGTGG TGTTAATTTC GACCATTGTC GGCGAAGCGA ACTTCCTCGA CTGGGGCTGG
CGTATTCCGT TCTTTATTGC TCTGCCGTTA GGGATTATCG GGCTTTACCT GCGCCATGCG
CTGGAAGAAA CTCCGGCGTT CCAGCAGCAT GTTGATAAAC TGGAACAGGG CGACCGCGAA
GGTTTGCAGG ATGGCCCGAA AGTCTCGTTT AAAGAGATTG CCACTAAATA CTGGCGCAGC
CTGTTGACAT GTATTGGTCT GGTAATTGCC ACCAACGTGA CTTACTACAT GTTGCTGACC
TATATGCCGA GTTATTTGTC GCATAACCTG CATTACTCCG AAGACCACGG GGTGCTGATT
ATTATCGCCA TTATGATCGG TATGCTGTTT GTCCAGCCGG TGATGGGCTT GCTGAGTGAC
CGTTTTGGCC GTCGTCCGTT TGTGCTACTT GGTAGTGTTG CACTGTTTGT GTTGGCGATC
CCGGCGTTTA TTCTGATTAA CAGTAACGTC ATCGGCCTGA TTTTTGCCGG GTTACTGATG
CTGGCGGTGA TCCTTAACTG CTTTACGGGC GTTATGGCTT CTACCTTGCC AGCGATGTTC
CCGACGCATA TCCGTTACAG CGCGCTGGCG GCGGCATTTA ATATTTCGGT GCTGGTTGCC
GGTCTGACGC CAACACTGGC GGCCTGGCTG GTCGAAAGCT CGCAGAATCT GATGATGCCA
GCCTATTACC TGATGGTAGT GGCGGTGGTT GGTTTAATCA CCGGCGTAAC CATGAAAGAG
ACGGCAAATC GTCCGTTGAA AGGTGCAACA CCGGCGGCGT CAGATATACA GGAAGCGAAG
GAAATTCTCG TCGAGCATTA CGATAATATC GAGCAGAAAA TCGATGATAT TGACCACGAG
ATTGCCGATT TGCAGGCGAA ACGTACCCGC CTGGTGCAGC AACATCCGCG AATTGATGAA
TAA
 
Protein sequence
MLKRKKVKPI TLRDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD 
PSVQMVAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYD
TIGIWAPILL LICKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA
GVVVLISTIV GEANFLDWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE
GLQDGPKVSF KEIATKYWRS LLTCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI
IIAIMIGMLF VQPVMGLLSD RFGRRPFVLL GSVALFVLAI PAFILINSNV IGLIFAGLLM
LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLVA GLTPTLAAWL VESSQNLMMP
AYYLMVVAVV GLITGVTMKE TANRPLKGAT PAASDIQEAK EILVEHYDNI EQKIDDIDHE
IADLQAKRTR LVQQHPRIDE