Gene EcHS_A0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0923 
Symbol 
ID5594297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp927547 
End bp928572 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content50% 
IMG OID640920093 
ProductPBSX family phage portal protein 
Protein accessionYP_001457660 
Protein GI157160342 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGA GTAAGAAGAA CCGCGCTGCG GCGACGAAAC AGATCCAGCT TAAAAGTCAA 
ACTACAGCCG AAGCATTCAG CTTCGGCGAT CCCGTTCCTG TTCTGGACCG CCGAGAACTG
CTGGATTATG TGGAATGCGT ACAGATGGAC CGCTGGTATG AGCCGCCCGT CAGCTTTGAC
GGACTGGCGC GCACCTTCCG CGCTGCCGTG CATCATAGTT CCCCGATTGC AGTAAAGTGC
AACATTCTGA CCAGCACCTA CATCCCTCAC CCGCTGCTCA GCCAGCAGGC TTTTTCGCGT
TTTGTGCAGG ACTATCTGGT ATTTGGTAAC GCCTACCTGG AGAAACGCAC GAACCGCTTC
GGTGAAGTTA TCGCCCTTGA ACCTGCCCTG GCAAAATATA CCCGACGCGG GTTAGACCTG
GATACCTACT GGTTTGTGCA ATACGGTATG ACCACGCAGC CATATCAGTT CACGAAAGGC
AGCATCTTTC ATCTGATGGA ACCGGACATC AACCAGGAGA TCTACGGCCT GCCCGGTTAT
CTTTCTGCCA TTCCGTCAGC CCTGCTCAAC GAGTCCGCCA CGCTGTTCCG CCGAAAGTAT
TACATTAACG GCAGTCATGC TGGCTTCATC ATGTACATGA CCGATGCTGC GCAGAACCAG
GAGGATGTGA ACAACCTCCG CAACGCAATG AAAAGCGCCA AAGGTCCAGG CAACTTCCGC
AACCTGTTTA TGTACTCACC TAACGGCAAA AAGGATGGTC TTCAGATTAT CCCGTTGTCA
GAAGTCGCGG CGAAGGATGA ATTTCTGAAT ATCAAAAATG TCAGCCGCGA CGACATGATG
GCTGCGCACC GTGTACCGCC ACAAATGATG GGGATAATGC CTAATAATGT TGGGGGATTT
GGGGATGTGG AGAAAGCCTG CAAAGTATTT GTTAGAAATG AGTTAACAGT ATTACAAAAA
AAAATACTGG AACTGAACAC TTGGTTAGAT GATGATGTAA TTAATTTTAA TGAGTATATG
CTTTGA
 
Protein sequence
MGKSKKNRAA ATKQIQLKSQ TTAEAFSFGD PVPVLDRREL LDYVECVQMD RWYEPPVSFD 
GLARTFRAAV HHSSPIAVKC NILTSTYIPH PLLSQQAFSR FVQDYLVFGN AYLEKRTNRF
GEVIALEPAL AKYTRRGLDL DTYWFVQYGM TTQPYQFTKG SIFHLMEPDI NQEIYGLPGY
LSAIPSALLN ESATLFRRKY YINGSHAGFI MYMTDAAQNQ EDVNNLRNAM KSAKGPGNFR
NLFMYSPNGK KDGLQIIPLS EVAAKDEFLN IKNVSRDDMM AAHRVPPQMM GIMPNNVGGF
GDVEKACKVF VRNELTVLQK KILELNTWLD DDVINFNEYM L