Gene EcHS_A0309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0309 
Symbol 
ID5593467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp312144 
End bp313559 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content51% 
IMG OID640919495 
ProductPBSX family phage terminase large subunit 
Protein accessionYP_001457081 
Protein GI157159763 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCGA TTAATCCTAT CTTTGAACCG TTCATTGAGG CGCATCGCTA CAAAGTCGCC 
AAAGGCGGTC GAGGTAGCGG TAAGTCATGG GCAATTGCGA GGCTGCTTGT TGAAGCGGCG
CGTCGGCAGC CAGTGCGTAT TCTTTGCGCT CGTGAACTGC AAAACAGTAT CAGCGATTCG
GTAATCCGGT TGCTTGAAGA TACCATCGAG CGTGAAGGGT ATTCGGCTGA GTTTGAAATT
CAGCGTTCCA TGATTCGTCA TCTCGGAACG AATGCTGAAT TCATGTTCTA CGGCATCAAA
AACAACCCGA CGAAGATTAA ATCGCTCGAA GGCATTGATA TCTGCTGGGT GGAAGAAGCG
GAAGCGGTAA CGAAGGAATC ATGGGATATC CTGATACCAA CCATCCGTAA GCCGTTCTCT
GAAATATGGG TGAGCTTTAA CCCGAAGAAC ATACTCGACG ATACCTATCA GCGATTCGTT
GTAAATCCTC CCGATGATAT TTGCCTGCTG ACGGTGAACT ACACCGACAA CCCGCACTTT
CCTGAAGTTC TCCGTCTGGA GATGGAAGAG TGTAAACGCA GAAACCCGAC ACTGTATCGT
CACATCTGGC TTGGTGAGCC AGTAAGCGCA AGTGATATGG CAATCATCAA ACGTGAATGG
CTTGAAGCCG CAACCGATGC GCACAAGAAA CTCGGATGGA AAGCGAAAGG CGCTGTTGTC
TCTGCACATG ATCCATCAGA TACAGGGCCA GATGCTAAAG GTTACGCATC GCGCCACGGT
TCGGTAGTTA AGCGCATTGC CGAAGGTCTG CTGATGGACA TCAACGATGG TGCTGACTGG
GCTACTTCGC TGGCGATTGA AGACGGCGCT GACCACTACC TGTGGGATGG TGATGGTGTT
GGTGCCGGGC TACGCAGACA GACAACGGAA GCGTTCTCCG GCAAGAAAAT CACCGCCACG
ATGTTCAAGG GCAGCGAATC GCCATTCGAT GAAGATGCAC CGTATCAGGC CGGAGCATGG
GCCGATGAAG TCGTACAGGG CGACAACGTT CGCACTATTG GCGATGTATT CCGCAATAAG
CGAGCGCAAT TCTATTACGC GCTGGCTGAC AGGCTGTATC TGACATATCG GGCGGTTGTT
CATGGTGAGT ATGCAGACCC CGACGACATG CTGAGTTTCG ACAAAGAAGC GATAGGCGAG
AAGATGCTGG AGAAGCTGTT TGCAGAACTG ACGCAGATTC AGCGCAAATT CAATAACAAC
GGGAAGCTGG AGCTAATGAC TAAGGTCGAA ATGAAGCAGA AGCTCGGTAT TCCATCTCCT
AACCTGGCTG ATGCGCTGAT GATGTGTATG CATTGCCCGG AGTCGGCTGC GCAACCCGAC
TATTCCAGTT ACTCAATTCC TTGTGGTGTA GGTTGA
 
Protein sequence
MTSINPIFEP FIEAHRYKVA KGGRGSGKSW AIARLLVEAA RRQPVRILCA RELQNSISDS 
VIRLLEDTIE REGYSAEFEI QRSMIRHLGT NAEFMFYGIK NNPTKIKSLE GIDICWVEEA
EAVTKESWDI LIPTIRKPFS EIWVSFNPKN ILDDTYQRFV VNPPDDICLL TVNYTDNPHF
PEVLRLEMEE CKRRNPTLYR HIWLGEPVSA SDMAIIKREW LEAATDAHKK LGWKAKGAVV
SAHDPSDTGP DAKGYASRHG SVVKRIAEGL LMDINDGADW ATSLAIEDGA DHYLWDGDGV
GAGLRRQTTE AFSGKKITAT MFKGSESPFD EDAPYQAGAW ADEVVQGDNV RTIGDVFRNK
RAQFYYALAD RLYLTYRAVV HGEYADPDDM LSFDKEAIGE KMLEKLFAEL TQIQRKFNNN
GKLELMTKVE MKQKLGIPSP NLADALMMCM HCPESAAQPD YSSYSIPCGV G