Gene ECH74115_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3052 
Symbol 
ID6972087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2826559 
End bp2828331 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content57% 
IMG OID643386884 
Productphage large terminase subunit GpP 
Protein accessionYP_002271352 
Protein GI209400321 
COG category[S] Function unknown 
COG ID[COG5484] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.680947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000000111956 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCATCA CCACAGACAC CACTCTTTTA CACGACCCGC GTCGTCAGGC GGCGCTGCTG 
TACTGGCAGG GATTTTCCGT GCCGCAGATT GCCGCCATGT TGCAGATGAA ACGCCCGACG
GTGCAGAGCT GGAAACAGCG CGACGGCTGG GACAGCGTTG CCCCCATCAG CCGTGTCGAA
ATGAGTCTGG AAGCGCGGCT GACCCAGCTC ATCATCAAAC CGCAGAAAAC CGGCGGTGAC
TTCAAGGAAA TTGACCTGCT CGGACGCCAG ATTGAACGAC TGGCACGGGT AAACCGCTAC
AGCCAGACCG GCAACGAGGC AGACCTTAAT CCGAACATCG CTAACCGCAA CAAAGGCGGG
CGGCGCAAAC CGAAAAAGAA TTTTTTCAGT GACGAGGCTA TCGAAAAGCT GGAGCAGATT
TTCTTTGAGC AGTCTTTCGA ATATCAGTTG CACTGGTATC GCGCCGGGCT TGAGCACCGC
ATCCGCGATA TCCTGAAATC CCGCCAGATT GGCGCGACGT TTTATTTTTC CCGCGAGGCG
CTGCTGCGCG CCCTGAAAAC CGGTCATAAC CAGATTTTTC TGTCGGCCAG TAAAACGCAG
GCGTATGTGT TCCGCGAATA CATCATCGCC TTTGCCCGGC TGGTTGACGT TGACCTGACC
GGTGACCCGA TTGTCCTGGG CAATAACAGC GCAAAACTGA TTTTTCTCGG CACCAACTCC
AACACCGCGC AGAGCCATAA CGGCGACCTG TACGTCGACG AGATTTTCTG GATACCGAAT
TTTCAGGTAC TGCGTAAGGT GGCATCAGGT ATGGCCTCAC AGAGTCACCT GCGCTCGACC
TATTTCTCCA CCCCGTCCAC GCTGGCGCAC GACGCCTACC CGTTCTGGTC GGGTGAACTG
TTCAACCGGG GACGCGCCAG CGCCGCCGAA CGCGTGGAAA TCGACGTCAG TCATAACGCT
CTTGCCGGTG GGCTTCTCTG TGCGGACGGT CAGTGGCGGC AGATTGTCAC CATTGAGGAC
GCCCTGAAAG GTGGCTGCAC GCTGTTCGAC ATTGAGCAGC TTAAACGCGA AAACAGCGCC
GACGATTTTA AAAACCTGTT CATGTGTGAA TTTGTTGACG ACAAGGCGTC GGTGTTCCCG
TTCGAGGAGC TGCAACGCTG CATGGTCGAC ACGCTGGAAG AATGGGAAGA CTATGCGCCG
TTTGCCGCGA ATCCGTTCGG CTCCCGCCCG GTCTGGATTG GTTACGACCC GTCACACCGT
GGCGACAGTG CCGGATGCGT GGTGCTGGCA CCGCCGGTGG TGGCCGGTGG CAAATTCAGA
ATACTTGAGC GTCACCAGTG GAAAGGCATG GACTTTGCCA CCCAGGCTGA ATCCATCCGC
AAACTCACCG AAAAATACAA CGTCGAATAC ATCGGAATTG ATGCCACCGG CCTCGGTGTC
GGCGTGTTCC TGCTCGTTCG CTCGTTCTAT CCCGCCGCAC GCGATATCCG CTACACGCCG
GAAATGAAAA CCGCAATGGT GCTCAAGGCA AAAGACGTTA TTCGCCGTGG CTGTCTGGAA
TATGACGTCA GCGCCACCGA CATCACCAGC TCGTTTATGG CTATCCGCAA GACCATGACC
AGCAGCGGAC GCAGCGCCAC CTATGAGGCC AGCCGCAGCG AGGAAGCCAG CCACGCCGAC
CTCGCCTGGG CGACCATGCA CGCCCTGTTA AATGAGCCAC TCACCGCCGG TATCAGCACC
CCGCTGACAT CCACCATTCT GGAGTTTTAC TGA
 
Protein sequence
MTITTDTTLL HDPRRQAALL YWQGFSVPQI AAMLQMKRPT VQSWKQRDGW DSVAPISRVE 
MSLEARLTQL IIKPQKTGGD FKEIDLLGRQ IERLARVNRY SQTGNEADLN PNIANRNKGG
RRKPKKNFFS DEAIEKLEQI FFEQSFEYQL HWYRAGLEHR IRDILKSRQI GATFYFSREA
LLRALKTGHN QIFLSASKTQ AYVFREYIIA FARLVDVDLT GDPIVLGNNS AKLIFLGTNS
NTAQSHNGDL YVDEIFWIPN FQVLRKVASG MASQSHLRST YFSTPSTLAH DAYPFWSGEL
FNRGRASAAE RVEIDVSHNA LAGGLLCADG QWRQIVTIED ALKGGCTLFD IEQLKRENSA
DDFKNLFMCE FVDDKASVFP FEELQRCMVD TLEEWEDYAP FAANPFGSRP VWIGYDPSHR
GDSAGCVVLA PPVVAGGKFR ILERHQWKGM DFATQAESIR KLTEKYNVEY IGIDATGLGV
GVFLLVRSFY PAARDIRYTP EMKTAMVLKA KDVIRRGCLE YDVSATDITS SFMAIRKTMT
SSGRSATYEA SRSEEASHAD LAWATMHALL NEPLTAGIST PLTSTILEFY