Gene ECH74115_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2058 
Symbol 
ID6969379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1953661 
End bp1954722 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content50% 
IMG OID643385970 
Producthypothetical protein 
Protein accessionYP_002270459 
Protein GI209399084 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.132417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00041859 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATTTAC GTCATCTGTT TTCATTGCGC CTGCGTGGTT CATTACTGTT AGGTTCATTG 
CTTGTTGCTT CATCATTCAG TACGCAGGCC GCAGAAGAAA TGCTGCGTAA AGCGGTAGGT
AAAGGTGCCT ACGAAATGGC TTATAGCCAG CAAGAAAACG CGCTGTGGCT CGCCACTTCG
CAAAGCCGCA AACTGGATAA AGGCGGCGTG GTTTATCGTC TTGATCCGGT CACTCTGGAA
GTGACGCAGG CGATCCATAA CGATCTCAAG CCGTTTGGTG CCACCATCAA TAACACGACT
CAGACGTTGT GGTTTGGTAA CACCGTAAAC AGCGCGGTCA CGGCGATAGA TGCCAAAACT
GGCGAGGTGA AAGGCCGTCT GGTGCTGGAT GATCGTAAGC GCACGGAAGA GGTGCGCCCG
CTGCAACCGC GTGAGCTGGT AGCTGATGAT GCCACGAACA CCGTTTACAT CAGTGGTATT
GGTAAAGATA GCGTGATTTG GGTCGTTGAT GGCGAGAATA TCAAACTGAA AACCGCCATC
CAGAACACCG GTAAAATGAG TACCGGTCTG GCACTGGATA GCAAAGGCAA ACGTCTTTAC
ACCACTAACG CTGACGGCGA ATTGATTACC ATCGACACCG CCGACAATAA AATCCTCAGC
CGTAAAAAGC TGCTGGATGA CGGCAAAGAG CACTTCTTTA TCAACATCAG CCTTGATACC
GCCAGGCAGC GTGCATTTAT CACCGATTCT AAAGCGGCAG AAGTGTTAGT GGTCGATACC
CGTAATGGCA ATATTCTGGC GAAGGTTGCG GCACCGGAAT CACTGGCTGT GCTGTTTAAC
CCAGCGCGTA ATGAAGCCTA CGTAACGCAT CGTCAGGCAG GTAAAGTCAG TGTGATTGAC
GCGAAAAGCT ATAAAGTGGT GAAAACGTTC GATACGCCGA CTCATCCGAA CAGCCTGGCG
CTGTCTGCCG ATGGCAAAAC GCTGTATGTC AGTGTGAAAC AAAAATCCAC TAAACAGCAG
GAAGCTACCC AGCCAGACGA TGTGATTCGT ATTGCGCTGT AA
 
Protein sequence
MHLRHLFSLR LRGSLLLGSL LVASSFSTQA AEEMLRKAVG KGAYEMAYSQ QENALWLATS 
QSRKLDKGGV VYRLDPVTLE VTQAIHNDLK PFGATINNTT QTLWFGNTVN SAVTAIDAKT
GEVKGRLVLD DRKRTEEVRP LQPRELVADD ATNTVYISGI GKDSVIWVVD GENIKLKTAI
QNTGKMSTGL ALDSKGKRLY TTNADGELIT IDTADNKILS RKKLLDDGKE HFFINISLDT
ARQRAFITDS KAAEVLVVDT RNGNILAKVA APESLAVLFN PARNEAYVTH RQAGKVSVID
AKSYKVVKTF DTPTHPNSLA LSADGKTLYV SVKQKSTKQQ EATQPDDVIR IAL