Gene ECH74115_0874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0874 
Symbol 
ID6972120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp889121 
End bp890554 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID643384899 
Productanion transporter 
Protein accessionYP_002269399 
Protein GI209397894 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AATCGTTATG GAAGCTAATT CTGATATTAG CGATCCCATG TATTATTGGC 
TTTATGCCAG CTCCGGCAGG ATTAAGCGAA CTGGCGTGGG TGCTTTTTGG TATTTACCTG
GCGGCCATTG TGGGGCTGGT TATCAAGCCT TTCCCGGAAC CTGTCGTACT GTTAATTGCC
GTTGCTGCCT CCATGGTGGT GGTCGGTAAC TTATCCGACG GTGCGTTTAA GACCACCGCC
GTATTAAGCG GTTACTCTTC AGGTACTACC TGGCTGGTGT TCTCGGCGTT TACCTTAAGC
GCCGCATTTG TGACCACCGG TTTAGGTAAA CGTATTGCCT ATCTGCTGAT TGGTAAAATC
GGTAACACCA CGCTGGGTCT GGGTTACGTT ACGGTATTCC TCGATCTGGT ATTGGCTCCG
GCAACACCGT CTAACACCGC GCGTGCGGGC GGCATTGTGT TACCGATCAT CAACAGCGTG
GCGGTGGCTT TGGGGTCCGA ACCGGAAAAA AGTCCGCGTC GTGTCGGACA TTACCTGATG
ATGTCCATTT ACATGGTCAC CAAAACCACC AGCTATATGT TCTTTACCGC AATGGCGGGG
AACATTCTGG CGCTGAAAAT GATCAACGAC ATTCTGCACC TGCAAATTAG CTGGGGTGGA
TGGGCGCTGG CCGCCGGATT GCCGGGCATC ATTATGCTGC TGGTCACCCC GCTGGTGATT
TACACCATGT ATCCACCAGA AATTAAGAAG GTGGATAACA AAACCATCGC TAAAGCGGGC
CTTGCCGAAC TGGGACCGAT GAAAATCCGC GAAAAAATGC TGCTCGGTGT CTTTGTGCTG
GCGCTGCTGG GCTGGATTTT CAGTAAGTCT CTGGGGGTTG ATGAATCTTC AGTGGCTATT
GTTGTTATGG CGACCATGCT GCTGCTGGGT ATCGTTACCT GGGAAGACGT GGTTAAAAAT
AAAGGCGGCT GGAATACCTT AATCTGGTAC GGCGGTATTA TCGGCTTAAG CTCCTTATTA
TCGAAAGTTA AATTTTTCGA ATGGTTAGCT GAAGTCTTTA AAAATAACCT GGCATTTGAT
GGTCACGGTA ACGTTGCTTT CTTCGTTATT ATTTTCCTCA GCATCATCGT GCGTTATTTC
TTCGCTTCCG GTAGTGCCTA TATCGTTGCT ATGTTACCGG TATTTGCCAT GCTGGCGAAC
GTCTCCGGCG CACCGTTAAT GTTAACCGCG CTGGCACTGT TGTTCTCCAA CTCCTATGGC
GGCATGGTTA CTCACTATGG CGGCGCGGCA GGTCCGGTCA TCTTTGGCGT GGGTTATAAC
GATATTAAAT CCTGGTGGTT GGTCGGTGCG GTACTGACGA TATTAACCTT CCTGGTGCAT
ATCACCCTCG GCGTGTGGTG GTGGAATATG CTGATCGGCT GGAACATGCT GTAA
 
Protein sequence
MNKKSLWKLI LILAIPCIIG FMPAPAGLSE LAWVLFGIYL AAIVGLVIKP FPEPVVLLIA 
VAASMVVVGN LSDGAFKTTA VLSGYSSGTT WLVFSAFTLS AAFVTTGLGK RIAYLLIGKI
GNTTLGLGYV TVFLDLVLAP ATPSNTARAG GIVLPIINSV AVALGSEPEK SPRRVGHYLM
MSIYMVTKTT SYMFFTAMAG NILALKMIND ILHLQISWGG WALAAGLPGI IMLLVTPLVI
YTMYPPEIKK VDNKTIAKAG LAELGPMKIR EKMLLGVFVL ALLGWIFSKS LGVDESSVAI
VVMATMLLLG IVTWEDVVKN KGGWNTLIWY GGIIGLSSLL SKVKFFEWLA EVFKNNLAFD
GHGNVAFFVI IFLSIIVRYF FASGSAYIVA MLPVFAMLAN VSGAPLMLTA LALLFSNSYG
GMVTHYGGAA GPVIFGVGYN DIKSWWLVGA VLTILTFLVH ITLGVWWWNM LIGWNML