Gene ECH74115_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1203 
Symbol 
ID6970093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1212392 
End bp1213747 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content60% 
IMG OID643385200 
Producttail fiber protein 
Protein accessionYP_002269696 
Protein GI209400674 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCC GCCGGTTCAG GCGGCTTTTT TGTGGGGTGA ATATGGCAGT AAAGATTTCA 
GGTGTACTGA AAGACGGCAC AGGAAAACCG GTAGAGAACT GCACCATTCA ACTGAAAGCC
AGACGGACCA GCAGCACGGT GGTGGTGAAC ACGGTGGCCT CTGAAAATCC GGATGAAGCC
GGTCGTTACA GCATGGACGT TGAGTACGGT CAGTACAGCG TCATTCTGTT GGTGGAAGGA
TTCCCGCCGT CACATGCCGG GACCATCACC GTGTATGAAG ATTCTCAACC GGGGACGCTG
AATGATTTTC TCGGTGCCAT GTCGGAGGAT GACGTCCGGC CGGAGGCACT GCGTCGTTTT
GAACTGATGG TGGAAGAAGC GGCGCGTCAC GCTGAGGAGG CGAAGAAGAA TGCCGGAGAG
GCGGAGACGT CCGCGAGGAA TGCCGGCATA TCAGCCAGTC AGGCAGAAGA GAGCGCGGCA
AATGCTGACA CTTCAGCAGG GGATGCATCG GAGTCAGCCC GGCAGGCGGC AGAAAGTGCA
GCCGCTGCAA AGCAGTCAGA GGAGGCGTCC TCGTCCTCGG CTTCTGCGGC CGCTCAAAAA
GCCAGTGAGT CATCACAAAG TGCAGCAGAA GCTGAATTGT CAAGAAAGAC GGCAGAAAGT
GCAGCCGGTA ATGCAGCCAG GGATGCAACG ACCGCAACAG AAAAAGCCCG GGAGTCAGCA
GAAAGCGCAC AGTCAGCGGA ACAAAGCAGG ATAGCGGCGG AAGAAGCCGT AAACCGAATC
CCCACCGTGG TGGGACCTCC CGGGCCAAAG GGGGAACCGG GTCCCGCGGG TCCTCAGGGG
CCGAAGGGAG ATAAAGGAGA GCGTGGCGAC ACCGGCCCGG CAGGGGCAAC CGGCGAACGG
GGACCGGCAG GTGATGCTGG TCCGGCAGGC CCGCAGGGGC CGAAAGGTGA CAGGGGAGAG
CGGGGAGAGA CCGGTCTGAC GGGAAATGCA GGTCCACAGG GTCCAAAGGG AGACACCGGG
GCAGCAGGCC CGGCAGGCCC ACAGGGACCG AAAGGAGAAA CAGGTGCGGC TGGCCCGGTG
GGGGCAACCG GACCTCAGGG ACCGAAGGGC GACCCGGGGG AGACACAAAT CCGTTTTCGT
CTGGGGCCGG CGAGCATTAT TGAGACAAAC AGCCATGGCT GGTTCCCGGG TACAGATGGT
GCGCTCATCA CCGGACTGAC CTTTCTTGCC CCCAAAGATG CCACACGGGT TCAGGTTTTT
TTTCAGCATT TGCAGGTCAG GTTTGGTGAC GGGCCGTGGC AGGATGTTAA GGGGCTGGAT
GAAGTGGGCA GTGATACAGG CAGAACAGGA GAATGA
 
Protein sequence
MTARRFRRLF CGVNMAVKIS GVLKDGTGKP VENCTIQLKA RRTSSTVVVN TVASENPDEA 
GRYSMDVEYG QYSVILLVEG FPPSHAGTIT VYEDSQPGTL NDFLGAMSED DVRPEALRRF
ELMVEEAARH AEEAKKNAGE AETSARNAGI SASQAEESAA NADTSAGDAS ESARQAAESA
AAAKQSEEAS SSSASAAAQK ASESSQSAAE AELSRKTAES AAGNAARDAT TATEKARESA
ESAQSAEQSR IAAEEAVNRI PTVVGPPGPK GEPGPAGPQG PKGDKGERGD TGPAGATGER
GPAGDAGPAG PQGPKGDRGE RGETGLTGNA GPQGPKGDTG AAGPAGPQGP KGETGAAGPV
GATGPQGPKG DPGETQIRFR LGPASIIETN SHGWFPGTDG ALITGLTFLA PKDATRVQVF
FQHLQVRFGD GPWQDVKGLD EVGSDTGRTG E