Gene EcolC_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2778 
Symbol 
ID6064866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3045615 
End bp3046640 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content51% 
IMG OID641602184 
ProductPBSX family phage portal protein 
Protein accessionYP_001725733 
Protein GI170020779 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000105217 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAAAGA GTAAGAAGAA CCGCACTGCG GCGACGAAAC AGATCCAGCT TAAAAGTCAA 
ACTACAGCCG AAGCATTCAG CTTCGGCGAT CCCGTTCCTG TTCTGGACCG CCGAGAACTG
CTGGATTATG TGGAATGCGT ACAGATGGAC CGCTGGTATG AGCCGCCCGT CAGCTTTGAC
GGACTGGCGC GCACCTTCCG CGCTGCCGTG CATCATAGTT CCCCGATTGC AGTAAAGTGC
AACATTCTGA CCAGCACCTA CATCCCTCAC CCGCTGCTCA GCCAGCAGGC TTTTTCGCGT
TTTGTGCAGG ACTATCTGGT ATTTGGTAAC GCCTACCTGG AGAAACGCAC GAACCGCTTC
GGTGAAGTTA TCGCCCTTGA ACCTGCCCTG GCAAAATACA CCCGACGCGG GTTAGACCTG
GATACCTACT GGTTTGTGCA ATACGGTATG ACCACGCAGC CATATCAGTT CACGAAAGGC
AGCATCTTTC ATCTGATGGA ACCGGACATC AACCAGGAGA TCTACGGCCT GCCCGGTTAT
CTTTCTGCCA TTCCGTCAGC CCTGCTCAAC GAGTCCGCCA CGCTGTTCCG CCGAAAGTAT
TACATTAACG GCAGTCATGC TGGCTTCATC ATGTACATGA CCGATGCTGC GCAGAACCAG
GAGGATGTGA ACAACCTCCG CAACGCAATG AAAAGCGCCA AAGGTCCAGG CAACTTCCGC
AACCTGTTTA TGTACTCACC TAACGGCAAA AAGGATGGTC TTCAGATTAT CCCGTTGTCA
GAAGTCGCGG CAAAGGATGA ATTTCTGAAC ATCAAGAACG TGAGCCGGGA TGACATGATG
GCGGCACACC GCGTGCCTCC GCAAATGATG GGTATCATGC CGAATAATGT TGGCGGGTTT
GGGGATGTGG AGAAGGCATC CACGGTTTTT GTACGTAATG AATTAAAGCC TCTTCAACAA
CGAATTAGAG AGGTGAACAA TTGGCTACAT GATGACGTAA TAAAATTCCA AGATTACTCC
TTGTAA
 
Protein sequence
MGKSKKNRTA ATKQIQLKSQ TTAEAFSFGD PVPVLDRREL LDYVECVQMD RWYEPPVSFD 
GLARTFRAAV HHSSPIAVKC NILTSTYIPH PLLSQQAFSR FVQDYLVFGN AYLEKRTNRF
GEVIALEPAL AKYTRRGLDL DTYWFVQYGM TTQPYQFTKG SIFHLMEPDI NQEIYGLPGY
LSAIPSALLN ESATLFRRKY YINGSHAGFI MYMTDAAQNQ EDVNNLRNAM KSAKGPGNFR
NLFMYSPNGK KDGLQIIPLS EVAAKDEFLN IKNVSRDDMM AAHRVPPQMM GIMPNNVGGF
GDVEKASTVF VRNELKPLQQ RIREVNNWLH DDVIKFQDYS L