Gene EcHS_A2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2094 
Symbol 
ID5594163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2077958 
End bp2080084 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content55% 
IMG OID640921235 
ProductPhage terminase large subunit (GpA) 
Protein accessionYP_001458779 
Protein GI157161461 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.0401019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACAA ACATATCTCA GTCCTCGGAG AACCAGAGTT TAACCCAGCG GAAGATTGAG 
CGTCTTCAAC TGAGTGTCCG GAAAGGGTGG ACACCGCCGC CGCGGATCAG CGTCCCGCAA
TGGGCCGATG ACTACCGGAA GCTGGCGAAA GAAGCTGGCA GTACCTCCGG GAACTGGGAA
ACATCAACGG TTGAAATTGC CCGCGGTCCT ATGCTGGCCG CGACGGAATC GGGCGTCCAC
ATTATCACCG TGATGTGCTG TACCCAGTTA ATGAAAACCG CGCTGCTGGA AAACCTGTTT
GGTTATTTTG CGCACCTCGA CCCATGTCCG ATTTTGCTCC TGCAGCCGAA GGAAGAGGCC
GCTGAGCAGT TTTCCAAAGA ACGCATCAGC CCGCTGGTTA GGGTAACGCC AGTTCTGCGT
AACATCATAG GTGACTCAAA GCAGAAGAGT TCAAAAGAAA CCATTCTGTA TAAAGCTTTC
ACCGGCGGAT TTCTGGCGCT GGCCGGCGCC GGTAGTCCAG ATAACCTTGC GCGCCGTCCG
ATCCGTGTTC TGCTGGCAGA TGAGGTGGAT AAATATCCGA TTACCCGTGA GGGCGATCCC
ATCGCTCTGG CGGAAGAGCG AACCGCCACA TTTGGCCTTA ACTGGCTGTC CGTGCGGGCT
TGTTCGCCGA CGATCGAAGA TGAAAGCCGG ATTGCTGACA GTTACGAAGA TTCAGATCAG
CGGCGGGCCT CTGTAGTTTG CCCCCATTGC GGGCACCGAC AGTTCCTTGA TTTCTTCAAA
CATGTTCAAT GGCCAAAAGA AGGTGATAAG CACCTGACCA AAGCGGCCAT GATCCATTGT
GAATGTTGTG GTGCTGGCTG GTCAGAGGGT GAGCGTCTGC GGGCATTACA GACAATCCGC
TGGCATCAGA CCAAACCGTT TGAATGTTGT GGTTCCCGCC ATTCACCATT AATGGAATAC
GACCAGAAAT GGCATGAAGG AGACGAGGGC AGTGTTGATG CCGTCTGGCG CTGGTCAGAG
TCGGAACGGC ATGCCGTATA CCGGGCGATT TGCCCGGACT GCGGGGCCGA GGCGCTGGAT
AATCACCACG CCGGGTACCA GGCGTCAAAA CTCTTCAGTC CCTGGCAAAA AGACAAGCCA
TCGGACATTG CAAAGAAATA CCTCGATGCG AAAGGGGATC CGGATAAGGA ACAGGCCTGG
TGGAACACCC AGATGGGGTT GCCGCACCGG CCTAACCACG GGAAACAGCT CCCGGTTGAT
GTCCTGCTGG CGCGCCGTGA AGTCTTCCCG GCCGTCGTTC CTGATGGCGT GGCATTGTTA
ACTGCGGGCG TCGATACTCA GGATGACCGA TTCGAAATCA CGATCACTGG CTGGGGGCGG
GACGAGGAAT CGTGGTCAGT TGCGCATGAC GTCATTTATG GCGATCTGGA GACTGAGGAA
CCCTGGAAGC GCCTCGATGC GTACCTGAAA CAGATATGGC GACGCGGCGA CGGGCGAGGG
CTGAATATTC TGGCTGCATG TATGGACTCC GGCGGTCACC ACACGCAAAA GGTTTATGAG
TTCTGCAAAG ATCGCCTTGG GCGCCGCGTC TGGGCTATCA AGGGCGAATC TGCGCAGGGT
GGGAAACGCA ACCCCGTCTG GCCAACCAAG CGACCGACAT CAAAAAGTAA AGCCAGCTTC
AGGCCAATTA TACTTGGCGT GAACTCTGCG AAAGATGTTG TCCGTGGTCG TCTGCATCTT
GAACCGCCAG CTTTAGGTAC TGCCGGTGCG GGCTATATGC ACTTCCCGGA TGATCGTGAC
CTCGGCTATT TCAACCAGCT GCTGGCCGAG CGACTGGTTT ATAAAGTGGT GGCCGGTCAG
CGATTCAGTG TCTGGGAGCC TATCCCCGGA CGGGCGAACG AAGCACTCGA CTGCCTCGTT
TACAGCTATG CCGCGTTGTG TGGGCTGAAA CATATGGGAT TAAAACTCAA TGTTCGGGCC
GCTAACCTTC AGGCCGATCC CGATAAGTTC CTGCCGGCGC CAGCCGAGCC AGAAGAAAAA
ATCAATTACG AATTACCGGG TGCCATCGTG GATGAGGCTA TGGCTCCTGT TAAGCGTAAG
AACATTTCTA AACTCCTGCC GCAATAA
 
Protein sequence
MSTNISQSSE NQSLTQRKIE RLQLSVRKGW TPPPRISVPQ WADDYRKLAK EAGSTSGNWE 
TSTVEIARGP MLAATESGVH IITVMCCTQL MKTALLENLF GYFAHLDPCP ILLLQPKEEA
AEQFSKERIS PLVRVTPVLR NIIGDSKQKS SKETILYKAF TGGFLALAGA GSPDNLARRP
IRVLLADEVD KYPITREGDP IALAEERTAT FGLNWLSVRA CSPTIEDESR IADSYEDSDQ
RRASVVCPHC GHRQFLDFFK HVQWPKEGDK HLTKAAMIHC ECCGAGWSEG ERLRALQTIR
WHQTKPFECC GSRHSPLMEY DQKWHEGDEG SVDAVWRWSE SERHAVYRAI CPDCGAEALD
NHHAGYQASK LFSPWQKDKP SDIAKKYLDA KGDPDKEQAW WNTQMGLPHR PNHGKQLPVD
VLLARREVFP AVVPDGVALL TAGVDTQDDR FEITITGWGR DEESWSVAHD VIYGDLETEE
PWKRLDAYLK QIWRRGDGRG LNILAACMDS GGHHTQKVYE FCKDRLGRRV WAIKGESAQG
GKRNPVWPTK RPTSKSKASF RPIILGVNSA KDVVRGRLHL EPPALGTAGA GYMHFPDDRD
LGYFNQLLAE RLVYKVVAGQ RFSVWEPIPG RANEALDCLV YSYAALCGLK HMGLKLNVRA
ANLQADPDKF LPAPAEPEEK INYELPGAIV DEAMAPVKRK NISKLLPQ