Gene ECH74115_0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0967 
Symbol 
ID6970213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp981316 
End bp982434 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content52% 
IMG OID643384987 
Productcitrate transporter family protein 
Protein accessionYP_002269487 
Protein GI209399705 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.276496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC CTTTTTTACG CACGCTGCAA GGCGATCGTT TTTTTCAGTT ATTAATTCTT 
GTTGGTATCG GATTAAGTTT TTTCGTGCCC TTTGCACCGA AATCCTGGCC TGCTGCTATC
GACTGGCACA CCATCATCAC CTTAAGCGGC CTGATGCTGC TGACCAAAGG TGTGGAGTTA
AGCGGTTATT TTGATGTGCT GGGGCGCAAA ATGGTGCGCC GCTTTGCTAC GGAGCGTCGG
CTGGCGATGT TTATGGTGCT GGCGGCGGCG CTGCTTTCTA CCTTTCTGAC CAACGATGTC
ACGCTGTTTA TTGTTGTTCC GCTGACTATC ACGCTAAAAA GGCTGTGTGA GATCCCGGTT
AATCGGCTGA TTATTTTTGA GGCGCTGGCA GTCAACGCTG GTTCGCTACT GACGCCAATT
GGCAACCCGC AAAATATTCT TATCTGGGGA CGTTCTGGTC TTTCGTTTGC CGGATTTATT
GCCCAAATGG CACCGCTGGC TGGCGCAATG ATGCTGACGC TCCTGCTGTT GTGCTGGTGT
TGTTTCCCTG GAAAGGCACT CCAATACCAT ACGGGGGTGC AAACACCGGA GTGGAAACCG
CGGCTGGTGT GGAGTTGTCT GGGGCTGTAT ATCGTCTTTC TGACGGCGCT GGAGTTCAAA
CAAGAGCTGT GGGGACTGGT GATTGTGGCG GCGGGCTTTG CGCTGCTGGC GCGTCGCGTG
GTGTTGAGTG TGGACTGGAC GCTGCTGCTG GTGTTTATGG CGATGTTTAT CGACGTCCAT
TTACTGACCC AGCTTCCAGC GTTGCAAGGC GTGTTGGGTA ACGTGAGTCA TTTATCTGAA
CCCGGATTAT GGTTAACGGC AATCGGTTTA TCGCAGGTGA TCAGTAACGT GCCGAGTACT
ATATTGTTGC TGAACTATGT GCCGCCGTCT TTATTACTGG CATGGGCGGT AAACGTAGGT
GGCTTTGGGT TATTACCCGG ATCGCTGGCA AATTTGATTG CGCTACGTAT GGCGAACGAT
CGCCGCATCT GGTGGCGTTT CCATCTCTAT TCAATACCGA TGCTGTTGTG GGCGGCGCTG
GTGGGATATG TTTTGTTGGT TATGATCCCG GCCTGGTAA
 
Protein sequence
MSLPFLRTLQ GDRFFQLLIL VGIGLSFFVP FAPKSWPAAI DWHTIITLSG LMLLTKGVEL 
SGYFDVLGRK MVRRFATERR LAMFMVLAAA LLSTFLTNDV TLFIVVPLTI TLKRLCEIPV
NRLIIFEALA VNAGSLLTPI GNPQNILIWG RSGLSFAGFI AQMAPLAGAM MLTLLLLCWC
CFPGKALQYH TGVQTPEWKP RLVWSCLGLY IVFLTALEFK QELWGLVIVA AGFALLARRV
VLSVDWTLLL VFMAMFIDVH LLTQLPALQG VLGNVSHLSE PGLWLTAIGL SQVISNVPST
ILLLNYVPPS LLLAWAVNVG GFGLLPGSLA NLIALRMAND RRIWWRFHLY SIPMLLWAAL
VGYVLLVMIP AW