Gene ECH74115_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3050 
Symbol 
ID6971965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2824399 
End bp2825472 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content55% 
IMG OID643386882 
Productphage major capsid protein GpN 
Protein accessionYP_002271350 
Protein GI209400016 
COG category 
COG ID 
TIGRFAM ID[TIGR01551] phage major capsid protein, P2 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000148658 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCAGG AAACCCGCTT TAAATTTAAT GCCTACCTGT CCCGTGTTGC CGAACTGAAC 
GGCATCGACG CCGGTGATGT GTCGAAAAAA TTCACCGTTG AACCGTCGGT CACCCAGACC
CTGATGAACA CCATGCAGGA GTCCTCTGAC TTTCTGACCC GCATCAACAT TGTGCCGGTC
AGCGAAATGA AAGGGGAAAA AATTGGTATT GGTGTCACCG GCTCCATCGC CAGCACCACA
GACACCGCCG GTGGCACCGA GCGTCAGCCG AAGGACTTCT CGAAGCTGGC GTCAAACAAG
TACGAATGCG ACCAGATTAA CTTCGATTTT TATATCCGCT ACAAAACGCT TGACCTGTGG
GCGCGTTATC AGGATTTCCA GCTCCGTGTC CGTAACGCCA TTATCAAACG CCAGTCCCTT
GATTTAATCA TGGCCGGTTT TAACGGCGTG AGGCGTGCCG AAACCTCTGA CCGCAGCAGT
AACCAGATGC TGCAGGATGT GGCGGTCGGC TGGCTGCAGA AATACCGCAA TGAAGCCCCG
GCGCGCGTGA TGAGCAAGGT TACTGACGAG GAAGGTCACA CGACCTCTGA GGTCATCCGC
GTGGGTAAGG GCGGTGATTA TGCCAGCCTC GATGCACTGG TGATGGATGC GACCAACAAC
CTGATTGAGC CGTGGTATCA GGAAGACCCT GACCTTGTGG TGATTGTGGG GCGTCAGCTG
CTGGCGGACA AGTATTTTCC CATCGTCAAC AGGGAGCAGG ACAACAGCGA GATGCTGGCC
GCTGACGTCA TCATCAGCCA GAAACGCATC GGTAACCTGC CGGCGGTACG CGTCCCGTAC
TTCCCGGCGG ATGCGATGCT CATCACGAAG CTGGAAAACC TGTCCATCTA CTACATGGAT
GACAGCCATC GCCGCGTGAT TGTGGAAAAC CCGAAACTCG ACCGCGTGGA GAACTACGAG
TCAATGAACA TTGATTACGT GGTGGAAGAC TACGCCGCCG GTTGTCTGGT GGAAAAAATT
AAGGTCGGTG ACTTCTCCAC ACCGACTAAA GTGACCGCAG AGCCGGGAGC GTAA
 
Protein sequence
MRQETRFKFN AYLSRVAELN GIDAGDVSKK FTVEPSVTQT LMNTMQESSD FLTRINIVPV 
SEMKGEKIGI GVTGSIASTT DTAGGTERQP KDFSKLASNK YECDQINFDF YIRYKTLDLW
ARYQDFQLRV RNAIIKRQSL DLIMAGFNGV RRAETSDRSS NQMLQDVAVG WLQKYRNEAP
ARVMSKVTDE EGHTTSEVIR VGKGGDYASL DALVMDATNN LIEPWYQEDP DLVVIVGRQL
LADKYFPIVN REQDNSEMLA ADVIISQKRI GNLPAVRVPY FPADAMLITK LENLSIYYMD
DSHRRVIVEN PKLDRVENYE SMNIDYVVED YAAGCLVEKI KVGDFSTPTK VTAEPGA