Gene ECH74115_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1016 
Symbol 
ID6970325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1028575 
End bp1029918 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content45% 
IMG OID643385030 
ProductPTS system ascorbate-specific transporter subunit IIC 
Protein accessionYP_002269530 
Protein GI209398665 
COG category[S] Function unknown 
COG ID[COG3037] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.279058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGGCG TTCCGACGAT GTTTGCCAAA TTCATAGATG TTATTCAAAC GTTTTTAACT 
GAACCGGCAA TTTTAATTGG TTTGCTGGTG GGTATTGGTT ATGCCCTTGA TAAGAAATCG
CCAATAAAAA TTATTACCGG CATGGTCAGC GCCATGGTGG GATTAATGAT GGTGTTATTT
GGTGGTTTCC AGTTTTCCGC AACATTTAAA CCGGTAGCAG AAGCCGTAAG TAAAGCCTAT
GGTGTTCACG GATATTTAAT GGATTCCTAC GCAATGAAAG CGGCAACGCA AATTGCATTG
GGGGATAATT TTGGTTATGT CGGGTATGTA TTTGTACTGG CGTTTTTTAC CAACCTGCTG
CTGGTATTAT TTGGTCGGTA TACTGGTGCC AAAGGGATAT TTTTGACAGG GAATACCGGT
GTTTCACATT CTCAGGCGGT ATTATGGTTG ATTGTCTTTT GGCTGGGTTT TAGCTGGACT
ACATCCATTA TTATTGCCGG AATATTAACG GGTGTATTCT GGGCGTTCTC CACCACGCTC
ATTGTTAAGC CCATTGCCAA AGTTACCAAA GATGCCGGTT TTACCATTGC TCATAACCAG
ATGTTGGGAT TGTGGTTTTT CTCCAAATTC GCCCATAAAT TTGGTGACCC TGAAAAGCAC
GATGCTGAAA ACCTGAAACT TCCCGGGTGG CTGGCGATTT TTAACCATAA CGTAACGGCT
ATCGCCATTG TGATGACGCT GTTTGTCGGT GGTTTCTTGC TATCAACGGG TATCGATAAT
GTCCAGTTAA TGGCGAAAGG CAAGCCCTGG TACATCTATA TCATCAACCT TGGCTTACAG
TTCTCCATGT ATATGGTCAT TCTGCTGCAG GGTGTGCGCA TGATGGTGGG CGAAATTAAC
GGTTCGTTTA AAGGCTGGCA AGATCGCTTT ATTCCTAACG CTATTCCTGC TGTCGATGTT
GCAGCGCTGT TACCTTTTTC ACCCAATGCC GCAACGTTAG GCTTCGTCTT CTGTACTTTT
GGCACCATTT TTTCGATGGG GATCTTATTA CTGGTTCACA GCCCAATTAT GGTATTGCCT
GGCTTTGTAC CTCTGTTTTT CTCTGGCGGT CCAATTGGCG TATTAGCAAA CCGCATGGGC
GGATACCGTT CCGTAATTAT CTGTACGTTC TTGCTGGGTA TTATTCAGAC CTTCGGTACG
GTGTGGGCTA TTCCGTTAAC CGGGCTTGCT GAAAATGGCG TCGGCTGGAC AGGAATATTT
GACTGGGCAA CCGTATGGCC TGCTATTTGT GAAGTTCTGA AATTTATCGC TGCAACATTC
CATCTTGGTC CTTACGCGGG TTAA
 
Protein sequence
MEGVPTMFAK FIDVIQTFLT EPAILIGLLV GIGYALDKKS PIKIITGMVS AMVGLMMVLF 
GGFQFSATFK PVAEAVSKAY GVHGYLMDSY AMKAATQIAL GDNFGYVGYV FVLAFFTNLL
LVLFGRYTGA KGIFLTGNTG VSHSQAVLWL IVFWLGFSWT TSIIIAGILT GVFWAFSTTL
IVKPIAKVTK DAGFTIAHNQ MLGLWFFSKF AHKFGDPEKH DAENLKLPGW LAIFNHNVTA
IAIVMTLFVG GFLLSTGIDN VQLMAKGKPW YIYIINLGLQ FSMYMVILLQ GVRMMVGEIN
GSFKGWQDRF IPNAIPAVDV AALLPFSPNA ATLGFVFCTF GTIFSMGILL LVHSPIMVLP
GFVPLFFSGG PIGVLANRMG GYRSVIICTF LLGIIQTFGT VWAIPLTGLA ENGVGWTGIF
DWATVWPAIC EVLKFIAATF HLGPYAG