Gene ECH74115_2230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2230 
Symbol 
ID6967152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2118550 
End bp2119773 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content60% 
IMG OID643386118 
Producttail fiber protein 
Protein accessionYP_002270605 
Protein GI209399699 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000000293165 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT AGAGAACTGC 
ACCATTCAAC TGAAAGCCAG ACGTAACAGC GCCACGGTGG TGGTGAACAC GGTGGCCTCT
GAAAATCCGG ATGAAGCCGG TCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT
ATTCTGTTGG TGGAAGGGTT CCCGCCGTCA CATGCCGGGA CCATCACCGT GTATGAAGAT
TCTCAACCGG GGACGCTGAA TGATTTTCTC GGTGCCATGT CGGAGGATGA CGTCCGGCCG
GAGGCACTGC GTCGTTTTGA ACTGATGGTG GAAGAAGCGG CGCGTCACGC TGAGGAGGCG
AAGAAGAATG CCGGAGAGGC GGAGACGTCC GCGAGGAATG CCGGCATATC AGCCAGTCAG
GCAGAAGAGA GCGCGGCAAA TGCTGACACT TCAGCAGGGG ATGCATCGGA GTCAGCCCGG
CAGGCGGCAG AAAGTGCAGC CGCTGCAAAG CAGTCAGAGG AGGCGTCCTC GTCCTCGGCC
TCTGCGGCCG CTCAAAAAGC CAGTGAGTCA TCACAAAGTG CAGCAGATGC TGAGTTGTCA
AAAAAGACGG CAGAAAGTGC AGCCGGTAAT GCAGCCAGGG ATGCAACGAC CGCAACAGAA
AAAGCCCGGG AGTCAGCAGA AAGCGCACAG TCAGCGGAAC AAAGCAGGAT AGCGGCGGAA
GAGGCCGTAA ACCGAATCCC CACGGTGGTG GGGCCTCCCG GGCCAAAGGG GGAACCGGGT
CCCGCGGGTC CTCAGGGGCC GAAGGGAGAT AAAGGAGAGC GTGGAGACAC CGGTCCGGCA
GGGGCAACCG GTGAAAGGGG GCCGGCAGGT GATGCTGGTC CGGCAGGCCC GGCAGGCCCG
GCAGGCCCAC AGGGACCGAA AGGAGAAACA GGTGCGGCTG GCCCGGTGGG GGCAACCGGA
CCTCAGGGGC CGAAGGGCGA CCCGGGGGAG ACGCAAATAC GGTTCCGTCT GGGGCCGGGA
AACATTATTG AGACAAACAG CCATGGCTGG TTCCCGGATA CAGATGGCGC ACTCATCACC
GGACTGACCT TTCTTGACCC CAAAGATGCC ACACGGGTTC AGGGTTTTTT TCAGCATTTG
CAGGTCAGGT TTGGTGACGG GCCGTGGCAG GATGTTAAGG GGCTGGATGA AGTGGGCAGT
GATACAGGCA GAACAGGAGA ATGA
 
Protein sequence
MAVKISGVLK DGTGKPVENC TIQLKARRNS ATVVVNTVAS ENPDEAGRYS MDVEYGQYSV 
ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMSEDDVRP EALRRFELMV EEAARHAEEA
KKNAGEAETS ARNAGISASQ AEESAANADT SAGDASESAR QAAESAAAAK QSEEASSSSA
SAAAQKASES SQSAADAELS KKTAESAAGN AARDATTATE KARESAESAQ SAEQSRIAAE
EAVNRIPTVV GPPGPKGEPG PAGPQGPKGD KGERGDTGPA GATGERGPAG DAGPAGPAGP
AGPQGPKGET GAAGPVGATG PQGPKGDPGE TQIRFRLGPG NIIETNSHGW FPDTDGALIT
GLTFLDPKDA TRVQGFFQHL QVRFGDGPWQ DVKGLDEVGS DTGRTGE