Gene EcHS_A2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2092 
Symbol 
ID5594161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2076052 
End bp2077626 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content54% 
IMG OID640921233 
Productlambda family phage portal protein 
Protein accessionYP_001458777 
Protein GI157161459 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.778496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGACGG GCGGTAGCCG GGTGCCATAT GACGCCGGGG ATTCCTTCAG CGATCAACTG 
GCGAACTGGC AGCCCGCACT ATGGTCACCG GATAACGAAA TTAATATCTA CCGTGACCGT
ATAGTTTCCC GTGCGCGGGA TCTGGTCCGA AATGATGGAT GGGCCAATGG TGCCATAACT
CGCCTGCTTG ATAATGCGGT CGGCGCCAAT TTCCGTCCGA TCATGAAGCC TGACTATCGT
GTATTACGGA TGATGACAGG TAATAAAAGT TTTGACGCAG TATGGGCGGA AGAGTACGGA
AAAGCACTCG CTTCCCACTG GAGAACCTGG GCATACGACA CAGGCCGTTA TTGTGACGTT
GAGCGCAAGC TAACCGTTCC ACAAATGTTG CGTCTGGCCT TTCGCCACAA GCTGATTGAT
GGCGATGCCC TGATGGTGCT TCAGTATCGC ACCGATCGCC TTGGACCAGG TAAGGGGCGT
TATGCCACGA CGGTGCAGGT TGTTGATCCC GACAGACTCA GTAACCCGCA GCAGAATTTT
GATATGCCGA ATATCCGCGG CGGCGTTGAA ATTGATGCTG ACGGCGCACC TGTGGCTTAT
CACATACGTG AAGCACATAT CGGTGACTGG TGGAGTGGCG CCAAAACAAT GACATGGCGA
CGAATCCCGC GCGAAACCGC ATGGGGGCGC CCGCACGTTG TGCACGACTT TGACCATGAG
CGTGGAGCTC AGCATCGGGG GAATGGCATT CTGACTCCAG TAGTGCAACG TCTGAAGATG
CTGGTGAAGT ACGACCAGAG CGAGCTGGAA GCAGCAATTC TGAATGCTAT CTTCGCCGCG
TATATTGAGT CTCCATACGA TCCCGAAATG ATCCAGTCCG CGCTGGGGGA AAACTTCGAA
GAGGGATTGG GAGCATACCA GGATGGTCGT GCAGAGTTTC ATAATGATCG CCGTTTGACG
CTGCAGAATG GCGCCCGTAT GCCGATCCTT TATCCAGGGG AGAGAATAAC AACGGTCAAC
GCTGCCCGCC CTTACAGCAA CTTTGAGGTT TTCGAGTCTG CAGTATTGCG TAATTTCTCA
TCCGGTACGG GGTTATCTCC TCAGCAGGTT ACACAGGACT GGTCTGATGT GAATTACAGC
TCTGCGCGAT CTTCCTTGCT GGAGGCATGG AAAACGCTCA CCCGCCGACG TGATGATTTC
TCCATGGGTA CCGCTCAGCC GCTTCTGACG GCCTTTGTGG AGGAAGTTCA CGATAACGAG
GATTTACCTC TACCTAATGA TGCCCCTGAT TTTGTTGATG CCCGGGCAGC GTATTCCCGT
GCGCGCTGGA TGGGGCCGGG GCGAGGATGG GTTGATCCGG TGGCAGAGAA AAAAGGCGCC
ATTCTCGGCC TCGATGCCGG CCTTTCCACT CTCGAAATTG AAGTGGGTGA AAACGTGGGT
GAGGACTGGG AAGAGATACT TGATCAGCGC CAGCGGGAAA TTGAGTCCTG CCTGAAGCGC
GGACTTCCAT TACCTAGCTG GGCGCAGGCG GACCAGTTCG CCAGCCAGAC AATTACCGAT
CCGGAGGAAA AGTGA
 
Protein sequence
MLTGGSRVPY DAGDSFSDQL ANWQPALWSP DNEINIYRDR IVSRARDLVR NDGWANGAIT 
RLLDNAVGAN FRPIMKPDYR VLRMMTGNKS FDAVWAEEYG KALASHWRTW AYDTGRYCDV
ERKLTVPQML RLAFRHKLID GDALMVLQYR TDRLGPGKGR YATTVQVVDP DRLSNPQQNF
DMPNIRGGVE IDADGAPVAY HIREAHIGDW WSGAKTMTWR RIPRETAWGR PHVVHDFDHE
RGAQHRGNGI LTPVVQRLKM LVKYDQSELE AAILNAIFAA YIESPYDPEM IQSALGENFE
EGLGAYQDGR AEFHNDRRLT LQNGARMPIL YPGERITTVN AARPYSNFEV FESAVLRNFS
SGTGLSPQQV TQDWSDVNYS SARSSLLEAW KTLTRRRDDF SMGTAQPLLT AFVEEVHDNE
DLPLPNDAPD FVDARAAYSR ARWMGPGRGW VDPVAEKKGA ILGLDAGLST LEIEVGENVG
EDWEEILDQR QREIESCLKR GLPLPSWAQA DQFASQTITD PEEK