Gene Ent638_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2600 
Symbol 
ID5113768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2802925 
End bp2804151 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content51% 
IMG OID640492790 
ProductHK97 family phage portal protein 
Protein accessionYP_001177319 
Protein GI146312245 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0139387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTTG ATGCCTTTTA CCGAAGTGAG TCGCTTGAAA ATCCCTCTAC ACCTATAACT 
GGCGACTCGG TCGATACGGA CGGGATCTTC AAGTCCGATG TATATGTCAG TCCTGAAACG
GCTATGAAAC TGGCAGCAGT CTACGCCTGT ATCTATGTCA TTTCTTCAAA TCTTGCTCAG
ATGCCACTGC ATGTAATGCG TAAGCACAAC GGCAAGGTTG AACCAGCGCG GGATCATCCT
GCGTTTTACC TGGTTCACGA CGAGCCGAAC ACCTGGCAAA CCAGCTACAA ATGGCGTGAG
CTGAAACAGC GTCACATCCT TGGCTGGGGT AATGGCTACA CCTGGGTAAA GCGCAGCCGC
CGCGGTGAGG TGACAGGTCT TGATTGCTGT ATGCCATGGG AAACGACCCT GATTAATACC
GGTGGTCGGT ATACCTACGG CTTGTACAAC GAAGAGGGTG CTTTCGCTAT TAGTCCTGAG
GATATGATTC ATATCCGCGC GCTGGGTAAC AATCAGAAAA TGGGGCTAAG TCCCATCATG
CAGCACGCCG AGACAATCGG CCTGGGGATG AGCGGACAGA AATACACGGA GAGTTTTTTC
AGCGGTAATG CGCGCCCAGC CGGGATAATT TCAGTTAAAA CCCCACTTCA GAGAGAAAGC
TGGGGTTGGC TGAAAGAGCA GTGGCAAAAA GCGTCACAGG CATTACGTAG CCAGGAAAAC
AAAACCCTGC TGCTACCAGC TGACCTTGAT TACAAAGCGC TGACTGTGTC TCCTGTTGAT
GCCCAGATCA TCGACATGTC AAAACTCAAC CGTTCCATGA TCGCCGGTAT TTTTAACGTG
CCGGCACACA TGATTAACGA CCTCGAAAAA GCCACCTTTA GCAATATCAC GCAGCAGGCC
ATTCAGTTTG TCCGCTACTC GATGATGCCC TGGGTGACGA ACTGGGAGCA GGAGCTTAAC
CGCCGCCTGT TTACCCGCGC CGAGCTGGCT GCTGGCTATT ACGTCCGTTT TAACCTGACT
GGCCTTTTGC GCGGTACGCC GCAGGAACGC GCCCAGTTCT ATCACTTCGC CATTACTGAT
GGCTGGATGA GTCGCAATGA AGCCCGCGCT TTCGAAGACA TGAATCCGGT TGACGGTCTG
GATGAAATGC TTGTCAGCGT GAATGCCGCT AACCCAGCGG ATGACTTTAA GAAACCAAAA
ACCGAAGAGG AAAAAACCGA TGAGTGA
 
Protein sequence
MLFDAFYRSE SLENPSTPIT GDSVDTDGIF KSDVYVSPET AMKLAAVYAC IYVISSNLAQ 
MPLHVMRKHN GKVEPARDHP AFYLVHDEPN TWQTSYKWRE LKQRHILGWG NGYTWVKRSR
RGEVTGLDCC MPWETTLINT GGRYTYGLYN EEGAFAISPE DMIHIRALGN NQKMGLSPIM
QHAETIGLGM SGQKYTESFF SGNARPAGII SVKTPLQRES WGWLKEQWQK ASQALRSQEN
KTLLLPADLD YKALTVSPVD AQIIDMSKLN RSMIAGIFNV PAHMINDLEK ATFSNITQQA
IQFVRYSMMP WVTNWEQELN RRLFTRAELA AGYYVRFNLT GLLRGTPQER AQFYHFAITD
GWMSRNEARA FEDMNPVDGL DEMLVSVNAA NPADDFKKPK TEEEKTDE