Gene Ent638_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2236 
Symbol 
ID5111217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2424954 
End bp2426297 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content52% 
IMG OID640492420 
ProductHK97 family phage portal protein 
Protein accessionYP_001176959 
Protein GI146311885 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.759812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAATC CTTTCCGGAA AAAAGAGAAG GCGCTGCAAC AACCCGCCAG CCAGGGCGCG 
TGGACTTCAC TGTTAAGTTT TGTCCGTGAA CCCTTTGCCG GGGCCTGGCA GCGTAATCTT
GAAATTAATC AGAACACCGT GCTTTCCTTT CACGCTGTGT TCTCCTGCAT ATCCCTTATT
GCCAGTGATA TTGCCAAGAT GCCGCTGCGA ATGATGCAAC GTGACTCAAA GGGTATCTGG
AAAGAAAGCA ACAGCGGTAA AGTCGCGGCG ATTTACAGGC GCCCCAATGC ATTCCAGAAC
CGCATTCAGT TTTTTGAATG CTGGCTTAAC TCGAAGCTTT GCCACGGCAA TACCGTTGCC
CTGAAGATAC GTAATTCCCG CGGCGAAATC ACAGAGCTTC GCATTCTGGA CTGGAACAAA
GTGACGCCAC TGGTGGCAGG CGATGGCTCA GTTTTTTACC AGATAAACCC AGACAATATG
ACAGGGGTTG AATCATCGGT CACCGTTCCG GCGCGCGAAG TTATCCACGA CCGTTTCAAC
TGTCTTTTTC ATCCGCTTGT CGGACTGTCA CCCATCTATG CTGCCGGACT GGCTGCGATG
CAGGGTCATC ATATCCAGGA AAACTCAGCC TTCTTCTTCC GCAACGGTAG TAAGCCGAGC
GGGGTTATTG AAGTGCCTGG CAACATCACC GAGGAAAATG CGCGGATCCT TAAAGCGAAC
TGGGATACGG GCTACACAGG TGAGAATGCA GGTAAAACGG GCCTGCTGAG CAACGGTGCC
AAGTACAACA CGGTTTCCAT GTCAGCTGAT GATGCGAAGA TGGTCGAGCA ACTCCAGATG
TCAGCGAAAA TCGTTTGCTC GGCATTTCAT GTCCCCGCAT ATAAGGCCGG GATCGGTGAA
CTTCCTTCCT ATGACAATAT CGAAGCGCTG GAGCAGCAGT ATTACTCCCA ATGCCTGCAG
GCGCTAATTG AGTCGATAGA ACTGCTGCTG GATGAGGCAT TTGAACTGGA GGATGGTACC
GGCACCGAGT TTGATGTGAG TGCGCTGCTG CGTATGGACA GCGAACGCCG GATCAAAACG
CTTGGGGAAG GTGTAAAAAA CACTATTCTC ACGCCGAATG AGGCGCGGCG CAGTGAAAAT
CTGCCTCCGG TCACCGGCGG CGATGAACTG TATCTGCAGC AGCAGAACTT TAGCCTTGGC
GCACTGGCGC GCCGTGATGC GTCAGACGAT CCCTTTGGCA AAGCCAGCCA GCCATCACCA
TCAGCCAGTG AAGAAGGAAA GGCGTTATCA GAAGCTGAAC AATCGGCGGC CAAAGTCATG
ATCAGAGGAT TACTTATCAA ATGA
 
Protein sequence
MRNPFRKKEK ALQQPASQGA WTSLLSFVRE PFAGAWQRNL EINQNTVLSF HAVFSCISLI 
ASDIAKMPLR MMQRDSKGIW KESNSGKVAA IYRRPNAFQN RIQFFECWLN SKLCHGNTVA
LKIRNSRGEI TELRILDWNK VTPLVAGDGS VFYQINPDNM TGVESSVTVP AREVIHDRFN
CLFHPLVGLS PIYAAGLAAM QGHHIQENSA FFFRNGSKPS GVIEVPGNIT EENARILKAN
WDTGYTGENA GKTGLLSNGA KYNTVSMSAD DAKMVEQLQM SAKIVCSAFH VPAYKAGIGE
LPSYDNIEAL EQQYYSQCLQ ALIESIELLL DEAFELEDGT GTEFDVSALL RMDSERRIKT
LGEGVKNTIL TPNEARRSEN LPPVTGGDEL YLQQQNFSLG ALARRDASDD PFGKASQPSP
SASEEGKALS EAEQSAAKVM IRGLLIK