Gene ECH74115_3508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3508 
Symbol 
ID6967267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3252488 
End bp3254425 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content61% 
IMG OID643387309 
Producttail fiber protein 
Protein accessionYP_002271772 
Protein GI209398916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.283696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGTTG TTGTTTCGGG GACGCTGAAA TCTCCTGATG GTGAGGCGAT ATCAGGAGCA 
AATATTACCC TGACGGCGCT GACAGTTTCA CCGGATGCGC TCAGCGGCAC CAGTGCGTCG
GCAGTGACCC GTGAAGGTGG ATATTACGGA ATGACGATGG ATCCGGGGGA GTATGCGGTT
TCGGTGACGG TGAAAGGGAA GACTGTTGTC TACGGACGTG TGCGTATTGA GGGGACCGAA
AGTACGGTGA CGCTCAATAT GCTGTTACGC CGCAGTCTTG TTGAGGTTAG CATACCCGGA
GAACTGCTGA CAGATTTCCG GCAGATACAG AATAATGTGG CTGATGACCT TGCCACTATT
CGTCGCCTGA ATGAAGACAC GGCGACAAAA AACACTCAGG CCACACAGTC AAAAGAAAGT
GCAGCAGCCA GTGCGAAGAG TGCATCTGAC AGTGCAAAGA CGGCAACCAG CAGGGCGGCT
GAAGCCGGAC AAAAAGCGAC TGATGCCACT GAGGCTGCGA CCCGTGCAGT CACAGCAGCG
GGGAATGCAG AGGAAAGCTC GACCCGTGCC GGAGAGTCTG AAAAAGCCGC CGGAGCTGAT
GCAGAAAAAG CCAGACAGCA TGCTGAAAAG GCCAGGCTGG CGCAGGAGAG CGCCGGAGAG
ATCCTTAAGC GGGCAGAGGC TGCCACTGTC AGTGCTGAAG AGGCCAGACG TATGGCTGAG
AATGCACGGG GGCCCCGGGG GCCTCAGGGA GAAACTGGTC CGAAGGGGGA TGTCGGTCCT
AAAGGCGAAA CAGGTCCAGT GGGCCCTCAA GGGCCCGCAG GGCCGAAAGG TGAGCGTGGT
GACGTTGGTG CTCAGGGGGC TGTAGGGCCT GCTGGTCCGC GTGGTGAGAA GGGCGAACAG
GGGGAGCGAG GACCGCAGGG AATACCAGGC CTGAAGGGGG ATACCGGAGA GCGGGGGCCT
AAAGGGGACC AGGGGGATAT GGGGCCAAAA GGCGAGAAAG GTGATCCGGG AGGTCCTGCA
GGCCCGCAAG GTCCTAAAGG CGAACGAGGA GAAGCCGGAC CACAGGGACC GATGGGAGCA
CGAGGTGAGC GTGGGGAGAC TGGCCCCCGA GGTGAACCTG GTCCTGCAGG TCCGAGAGGC
GAACGAGGAG AGACCGGACC TCAGGGACCT CGTGGAGAGC CAGGTCCGGC AGGCAGCGCT
GCAAATGTGG CTGATGCAAC GACGGCACAG AAGGGAATTG TGCAGTTAAG CAGCGCAACG
GACAGTGATG ATGAAACGAA GGCTGCCACC CCGAAAGCGG TGAAAGCGGC AATGGATGTG
GCAAATGAAG CGAAAACAAA GGCAGAAGAG GCTGCAGCAG GAGGTGGTGT TCCCGGTCCG
AAAGGAGATA AAGGGGACAC GGGGCCAGCA GGTCCGGCTG GGCCGAAGGG TGATAAGGGA
GAGCGCGGTG ACACCGGCCC TGTCGGGGCA ACCGGCGAAC GGGGACCGGC AGGTGATGCT
GGTCCGGCAG GCCCGCAGGG GCCGAAAGGT GACAGGGGAG AGCGGGGAGA GACCGGTCTG
ACGGGAAATG CAGGTCCACA GGGTCCAAAG GGAGATACCG GTGCGGCAGG CCCGGCAGGC
CCACAGGGAC CGAAAGGAGA AACAGGTGCG GCAGGCCCGG TGGGGGCGAC CGGACCTCAG
GGGCCGAAGG GCGACCCGGG GGAGACGCAA ATACGGTTCC GTCTGGGGCC GGGAAACATT
ATTGAGACAA ACAGCCATGG CTGGTTCCCG GATACAGATG GCGCACTCAT CACCGGACTG
ACCTTTCTTG ACCCCAAAGA TGCCACACGG GTTCAGGGTT TTTTTCAGCA TTTGCAGGTC
AGGTTTGGTG ACGGGCCGTG GCAGGATGTC AAGGGGCTGG ATGAAGTGGG CAGTGATACA
GGCAGAACAG GAGAATGA
 
Protein sequence
MSVVVSGTLK SPDGEAISGA NITLTALTVS PDALSGTSAS AVTREGGYYG MTMDPGEYAV 
SVTVKGKTVV YGRVRIEGTE STVTLNMLLR RSLVEVSIPG ELLTDFRQIQ NNVADDLATI
RRLNEDTATK NTQATQSKES AAASAKSASD SAKTATSRAA EAGQKATDAT EAATRAVTAA
GNAEESSTRA GESEKAAGAD AEKARQHAEK ARLAQESAGE ILKRAEAATV SAEEARRMAE
NARGPRGPQG ETGPKGDVGP KGETGPVGPQ GPAGPKGERG DVGAQGAVGP AGPRGEKGEQ
GERGPQGIPG LKGDTGERGP KGDQGDMGPK GEKGDPGGPA GPQGPKGERG EAGPQGPMGA
RGERGETGPR GEPGPAGPRG ERGETGPQGP RGEPGPAGSA ANVADATTAQ KGIVQLSSAT
DSDDETKAAT PKAVKAAMDV ANEAKTKAEE AAAGGGVPGP KGDKGDTGPA GPAGPKGDKG
ERGDTGPVGA TGERGPAGDA GPAGPQGPKG DRGERGETGL TGNAGPQGPK GDTGAAGPAG
PQGPKGETGA AGPVGATGPQ GPKGDPGETQ IRFRLGPGNI IETNSHGWFP DTDGALITGL
TFLDPKDATR VQGFFQHLQV RFGDGPWQDV KGLDEVGSDT GRTGE