Gene Ent638_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2238 
Symbol 
ID5111219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2426502 
End bp2428442 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content57% 
IMG OID640492422 
Productpeptidase U35, phage prohead HK97 
Protein accessionYP_001176961 
Protein GI146311887 
COG category 
COG ID 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTA ATCGCGCATG CACCCTCATG ACGGTGAAGG CAGTAAACGA GGATGAGCGG 
GTTATTACCG GCGTCGCCTC CACGCCGTCG CCAGACCGTG ACGGGGACAT TATGGAGCCG
GAAGGGGCTA AATTCCGCAG CGACACGCCG TTCCTCTGGC AGCATGACCG CTCCCAACCT
ATCGGCACCT GTACCCCAAA AATGGTGAAG GGCGGGCTGG AAATCACTGC AAAACTTGTG
AAGCCAACCC CGGATATGCC GTCCCAGTTG GTGGCCCGCC TCGATGAGGC CTGGGCATCC
ATTAAAGCCG GGCTGGTTCG TGGTCTCTCT ATCGGCTTTC GCCCGATCGA ATACTCCTTT
CTGGATGAAG GCGGCATCCG CTTTTTGTCC TGGGATCTTC TTGAAGTCTC GGCAGTGACC
ATTCCGGCGA ACGCCGAATG CTCGATCAAT ACCGTTAAAT CTTTCGACCG CCAGTTACTC
GCCGCGGCAG GCAATGAAAA ACCGGTGGTT AAAGCAACTC AATCCGTTGG CGCTACAGCA
CTTAAAACCA ATATCAAAAA AGGAAATAAT CCTATGAATA TCGCAGAACA GATCAAAAGC
TTTGAAGCGA AGCGTTCGGC GCTGGCGGCG TCTCTCTCTG ACATTATGGC GAAGGCTGCC
GAAGCCGGGC GTACGCTGGA TACGGAAGAA GAAGAGAGCT ACGACAACAC CTCCGCCGAA
ATCAAATCCG TGGATGCGCA CCTCAAACGC CTGCGCGACA TGGAATCCAG CATTGCCCTT
ACCGCCAAAC CGGTCAGCAA AGCCGCCGGC GGTGATGTAG CCGTAGTGAC GACAAGCGCG
CCGGGCATCA TCCGCGTTGA GCAGAAACTG GAAAAAGGTA TCGCTTTCGC CCGCTTCGCC
AAAGCGCTGG CCGCTGCGAA CGGCAGCCGC TCCGAGGCGC TGGAAATCGC GCGTAAACAG
TACCCGGATG ATTCGAAACT GCATCACGTC CTCAAAGCAG CCGTCGGCGC AGGCACAACC
ACTGACCCAA CCTGGGCTGG CTCGCTGGTT GAATATCAGG AATATGCGCA GGACTTTGTG
GAGTTCCTGC GACCACAGAC CATCATTGGC CGCTTCGGAC AGGGTAATAT CCCGGCGCTG
CGCCGGGTGC CATTTAATAT CCGCATTCCG GCGCAGACTT CCGGCGGTTC AGCCAACTGG
GTAGGCCAGG GTAAGGCGAA GCCTCTGACG AAATTTGATT TCGAATCGAT CACCTTCAGC
TTTGCGAAAG TGGCCGCCAT CGCGGTGCTG ACCGACGAGC TGATCCGTTT CTCTAACCCG
GCGGCGGACG CGCTGGTGCG TAATGCGCTT GCTGAAGCTG TTATTGCGCG TCTGGATACT
GACTTCATTA ATCCAGCCAA GGCAGAGGTC GCTAACGTCT CTCCTGCCTC AGTAACTAAC
GGTATTTCAG CCATTCCCTC TACCGGCGAT CCGGATGCAG ACGCTGAAGC CGCATTCGCT
CAGTTTGTTG CAGCGAACCT GCAGCCAACT GGCGGGGTGT GGATTATGTC CAGCACTAAC
GCACTAGCAC TGTCTATGAA GAAAAATGCG CTGGGGCAGA AGATGTACCC GGAAATGACA
CTGCTGGGCG GCACCTACCA GGGACTTCCG GCGATCGTTT CCCAGTACGC TGGTACCAAC
CTTACGCTCC TCAACGCGCC TGATATTTAT CTGGCGGACG ATGGTGGCGT GGCGGTGGAC
ATGTCCCGTG AAGCATCACT TGAAATGCAA AGCGATCCGA CCGGGGACAG CGTCAACGGC
ACGGGTACCG AGCTGGTTTC CATGTTCCAG ACCAACAGCG TGGCTATCCG TGCCGAGCGC
TGGATCAACT GGAAACGCCG CCGTACTGCT GCCGTTGCGG TAATTTCCGG CGTGAATTAC
GGCACCACCC AGACCAGCTA A
 
Protein sequence
MTLNRACTLM TVKAVNEDER VITGVASTPS PDRDGDIMEP EGAKFRSDTP FLWQHDRSQP 
IGTCTPKMVK GGLEITAKLV KPTPDMPSQL VARLDEAWAS IKAGLVRGLS IGFRPIEYSF
LDEGGIRFLS WDLLEVSAVT IPANAECSIN TVKSFDRQLL AAAGNEKPVV KATQSVGATA
LKTNIKKGNN PMNIAEQIKS FEAKRSALAA SLSDIMAKAA EAGRTLDTEE EESYDNTSAE
IKSVDAHLKR LRDMESSIAL TAKPVSKAAG GDVAVVTTSA PGIIRVEQKL EKGIAFARFA
KALAAANGSR SEALEIARKQ YPDDSKLHHV LKAAVGAGTT TDPTWAGSLV EYQEYAQDFV
EFLRPQTIIG RFGQGNIPAL RRVPFNIRIP AQTSGGSANW VGQGKAKPLT KFDFESITFS
FAKVAAIAVL TDELIRFSNP AADALVRNAL AEAVIARLDT DFINPAKAEV ANVSPASVTN
GISAIPSTGD PDADAEAAFA QFVAANLQPT GGVWIMSSTN ALALSMKKNA LGQKMYPEMT
LLGGTYQGLP AIVSQYAGTN LTLLNAPDIY LADDGGVAVD MSREASLEMQ SDPTGDSVNG
TGTELVSMFQ TNSVAIRAER WINWKRRRTA AVAVISGVNY GTTQTS