Gene Ent638_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2239 
Symbol 
ID5111220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2428501 
End bp2430159 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content56% 
IMG OID640492423 
Productphage terminase 
Protein accessionYP_001176962 
Protein GI146311888 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.478973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGT GGACAACAGC ATGCCCCGAC TGGGAATCCC TCCTGGTCGC AGGGGCGTCC 
ATTATTACGC CTCCGATCTT CCCTGACCAG GCAGAGCAGG CGCTGGGTAT TTTCCGGGAA
CTGCGTGTTT CCGACCTCCC GGGCAAACCC ACGTTCGGTG AGTGTTCTGA GGCCTGGGTG
TTTGACTTCG TGAAAGCCAT TTTCGGCGGT TATGAGGCTG ATACGGGAAA CCAGCTGATC
CGGGAATATG GTTTGCTGAT TTCCAAGAAG AACACCAAAT CGACCATCGC GGCGGGCATT
ATGCTGACCG CGCTGATCTT GTGCTGGCGT GAGGACGAGG AGCACCTGAT CCTGGCACCA
ACCAAGGAGG TCGCCGATAA CAGCTTTAAG CCTGCCGCCG GCATGATCCG CGCCGACGAA
GAATTGACCG ATATGTTCCA GATACAGGAT CATATTCGCA CCATTACCCA CCGGGTGACG
CGCAATACAT TAAAAGTTGT GGCTGCGGAT ACCGACACCG TGTCCGGTAA GAAGTCCGGT
CGCATCCTCG TGGATGAACT CTGGTTGTTC GGTAAGCGGG CGAACGCGGA AGCCATGTTT
ATGGAAGCGC TTGGCGGGCA GGTATCACGT AATGAAGGAT GGGTGATCTA CCTCACAACG
CAAAGTGATG AACCACCGGC GGGCGTGTTT AAAGAACGTC TAGATTACTG GCGCAATGTG
CGCGACGGCA AAATCATCGA TCCGAAAACG CTGGGCATTC TTTATGAGTT CCCGGAGAGC
ATGATCGATA GTAAGGCCTA TCTTGCACCT GAAAATTTCT ATATCACCAA CCCGAACATC
GGCCTGTCTG TCAGCCCCGA ATGGATAGCC GACAATCTCC GCAAGAATCA GGCAAAAACT
GATGGCACGC TGCAGCAGTT TCTGGCGAAG CACCTCAACA TTGAGATCGG CCTGAACCTG
CGAACCGACC GCTGGGCGGG TGTCGATTTC TGGGAGCAGC AGGCGCAGCG CGTAAGTTTT
GAAGATTTAC TGCGGCGCGC CGAGGTCATC ACTGTCGGGA TAGACGGCGG GGGGCTTGAT
GATCTGCTGG GCTTTTCAGC TATCGGACGT GACGCGGATA CGCGTGAATG GCTGTGCTGG
TGTCATGCCT GGGCGCATGA AATAGCGATC AGGCGTCGCA AAAGTGAAGA GTCAAGATTC
AACGATTTCG TGAAGGCCGG CGACCTTACC ATTGTGAAGC GTGTCGGTCA GGATACCGAA
GAAGTAGCGG AATATGTCAG CCGGATCCAC GTCGCGGAGC TGCTGGACAA GATAGGCATT
GACCCCTCAG GGGTCGGACA AATCCTTGAC GCGCTGATTG AGGCGGACAT TCCCGCCGAT
GCGGTGGTCG GCGTGAGTCA GGGCTGGCGC CTTGGTGGTG CGATCAAAAC CACAGAGCGC
AAGCTTGCCG AGGGGGTGCT GATCCATGCC GGACAGCCAC TGATGGCATG GTGCGTGGGT
AATGCCAGGG TTGAACCGAA GGGCAACGCC ATTCTCATCA CCAAACAGGC CAGCGGCAAG
GGCAAGATTG ACCCGCTTAT GGCGCTGTTC AACGCGGTAT CGCTGATGGC CCTTAACCCT
GAGGCGAAAA AACAGGACTA CCAGGTACTT TTCATATGA
 
Protein sequence
MAQWTTACPD WESLLVAGAS IITPPIFPDQ AEQALGIFRE LRVSDLPGKP TFGECSEAWV 
FDFVKAIFGG YEADTGNQLI REYGLLISKK NTKSTIAAGI MLTALILCWR EDEEHLILAP
TKEVADNSFK PAAGMIRADE ELTDMFQIQD HIRTITHRVT RNTLKVVAAD TDTVSGKKSG
RILVDELWLF GKRANAEAMF MEALGGQVSR NEGWVIYLTT QSDEPPAGVF KERLDYWRNV
RDGKIIDPKT LGILYEFPES MIDSKAYLAP ENFYITNPNI GLSVSPEWIA DNLRKNQAKT
DGTLQQFLAK HLNIEIGLNL RTDRWAGVDF WEQQAQRVSF EDLLRRAEVI TVGIDGGGLD
DLLGFSAIGR DADTREWLCW CHAWAHEIAI RRRKSEESRF NDFVKAGDLT IVKRVGQDTE
EVAEYVSRIH VAELLDKIGI DPSGVGQILD ALIEADIPAD AVVGVSQGWR LGGAIKTTER
KLAEGVLIHA GQPLMAWCVG NARVEPKGNA ILITKQASGK GKIDPLMALF NAVSLMALNP
EAKKQDYQVL FI