Gene EcolC_2775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2775 
Symbol 
ID6064857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3041798 
End bp3042856 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content55% 
IMG OID641602181 
ProductP2 family phage major capsid protein 
Protein accessionYP_001725730 
Protein GI170020776 
COG category 
COG ID 
TIGRFAM ID[TIGR01551] phage major capsid protein, P2 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.765557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000193345 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGAAGA ATACCCGCTT TGCTTTTAAC GCTTACCTGC AGCAGCTGGC GCGTCTGAAC 
GGTGTGGCAG TTGAAGAACT GTCCAGCAAG TTCACCGTAG AGCCGTCTGT ACAGCAGACG
CTGGAAGACC AGATTCAGCA ATCCGCCGCT TTCCTGACGC TGATTAACGT CACGCCAGTG
ACTGAGCAGT CTGGTCAGTT GCTGGGGCTG GGTGTTGGCA GCACCATTGC CGGAACCACT
GACACCACCG CGAAAGAGCG TGAACCTGTC GATCCGACGC TGATGGTCGA TGTGGAATAC
AAATGCGAGC AGACCAACTT TGACACGGTG CTGACCTACG CGAAGCTGGA CTTGTGGGCG
AAGTTTCAGG ATTTCCAGGT GCGTATCCGT GACGCCATCG TGAAACGTCA GGCACTGGAC
CGCATCATGA TCGGCTTTAA CGGCGTGAAG CGTGCGAAAA CCTCCAACCG TAGCGAAAAC
CCGCTGCTGC AGGATGTGAA CAAAGGCTGG CTGCAGAAAA TCCGTGAGGA TGCACCGGAT
CATGTCATGG GCAGCACCAC CACGGGCGGT GAAACCACAC CGGGTGCGGT GAAAGTCGGG
AAAGGTGGCG AATATGCCAA CCTGGACGCC GTGGTGATGG ATGCGGTCAA TGAGCTTATC
GACGTGGTCT ACCAGGACGA TGACGATCTG GTGGTGATTT GCGGTCGTGA ACTGTTGTCT
GACAAGTATT TCCCGCTGGT CAACAAAGAG CAGGAAAACA GTGAAAAACT GGCTGCCGAT
ATGATCATCA GCCAGAAACG CATGGGTGGC CTGCAGGCCG TGCGTGCGCC GTTCTTCCCG
CCGAATGCGC TGCTGATCAC CCGTCTGGAT AACCTGTCCA TCTACTGGCA GGAAGATACC
CGCCGCCGTT CAGTTATTGA CAACCCGAAA CGTGACCGGA TTGAAAACTT TGAATCCGTT
AACGAAGCCT ACGTGGTTGA GGACTACCGT TGCGCTGCAC TGGTGGAAAA CATCCAGATT
GGCGATTTCA GCGCCGCCGC AGCAGAAGCC GGAGCGTAA
 
Protein sequence
MKKNTRFAFN AYLQQLARLN GVAVEELSSK FTVEPSVQQT LEDQIQQSAA FLTLINVTPV 
TEQSGQLLGL GVGSTIAGTT DTTAKEREPV DPTLMVDVEY KCEQTNFDTV LTYAKLDLWA
KFQDFQVRIR DAIVKRQALD RIMIGFNGVK RAKTSNRSEN PLLQDVNKGW LQKIREDAPD
HVMGSTTTGG ETTPGAVKVG KGGEYANLDA VVMDAVNELI DVVYQDDDDL VVICGRELLS
DKYFPLVNKE QENSEKLAAD MIISQKRMGG LQAVRAPFFP PNALLITRLD NLSIYWQEDT
RRRSVIDNPK RDRIENFESV NEAYVVEDYR CAALVENIQI GDFSAAAAEA GA