Gene Ent638_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0442 
Symbol 
ID5113613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp498926 
End bp501310 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content58% 
IMG OID640490610 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001175181 
Protein GI146310107 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.327931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGTT ACAACCTCCT GACCGCCGGG CTTTTGCTCG GCTCTTCTGC TCTTGCGGCA 
CCGGCGGGCG ATTTGCCCCT CATGCCGTGG CCTGCCCACG TTGAGCGCCC AACGACGCAA
GGCGCGCTGG TTCTGAACGA TAAACTTTCT GTCAGCGTGA GCGGTGACGA TCTCGGTGAT
GCCGTGGACC GTCTGCGTCA GCGCATCGCG CTGCAAACCG GCTGGACGCT TCAGCCGCAG
GCTGTGAATC CGGATAAACC CACCATTCGC ATCGCTATCG CCAAAAAAGT TAACCCGCAA
CCTCTGCCCG ACAGCGATGA ACGTTACACG CTCACCGTCG ACGCCAACGG CGTCAATATC
GCCGCCAACA CCCGATTTGG TGCGCTGCGG GCGATAGAAA CGCTGCTCCA GCTCATTCAA
AACGGCGCGG AAAACACCTC GCTGCCGTGG GTGAAAATTG AAGATGCCCC GCGCTTCCCA
TGGCGCGGTC TGCTGCTCGA CTCCGCGCGT CATTTCATCC CGCTTGAAGA TATCAAACGG
CAGATCGACG GCATGGCGGC CGCCAAACTG AACGTGTTGC ACTGGCATTT AACCGACGAT
CAGGGCTGGC GATTTGCCTC GAAACGCTAT CCAAAACTGA CGCAACTGGC GAGCGACGGA
CTGTTTTACA CCTCTGATCA GATGCGTGAC ATCGTGCGCT ACGCCACCGC GCGCGGCGTG
CGCGTGGTGC CAGAAATCGA CATGCCGGGC CACGCGTCGG CGATTGCCGT GGCCTATCCG
GAGCTCATAA GCGCACCAGG GCCGTATGAA ATGGAACGCC ATTGGGGGGT GTTGAAACCG
GTTCTCGATC CGACAAAAGA AGCGACGTAT GCCTTTGCTG AGGCGATGGT GAGCGAACTG
GCGGCGATCT TCCCCGATCC GTATCTGCAT ATCGGCGGCG ATGAAGTTGA CGATACGCAG
TGGAAAGAAA ACAAAGCCAT TCAGCAATTT ATGCGCGACA ACAAACTTGC GGACAGCCAC
GCTTTACAGG CGTATTTCAA CCGCAAGCTG GAAACGATCC TTGAAAAACA TCACCGCCAG
ATGGTCGGCT GGGATGAGAT TTACCATCCG GATCTGCCCA AAAGCATTCT GATTCAGTCC
TGGCAGGGGC AGGACGCGCT CGGCGAAGTG GCGAAGCAGG GTTACAAAGG CATTCTCTCC
ACCGGTTTTT ATCTCGATCA GCCGCAAAGC ACGGCCTATC ACTATCGCAA TGAAATCGTG
CCGCAAGGCT TAAACGGCGT GGATATTATC GCCGATAACG ACAGCGCACA AAGCTGGACA
TTCACCATGC CGCGCCTGAA AGGCAAGCCG GTTGAGGGCA GCTTTACGCT GGTGAAAGCG
GTTTCTGGCT GGCGCGGATT TATTGATTTC AACGGTAAAT CCCGGCGTGC GGTGAATAAT
ATTGAGTGGC GTGATGACAA TCAGGTGACG TTCACCGTTG ATACCTGGAT GGGCGAAACG
CGCCCGGTGG TGAACGTCGC GGACGACAAG CTGACGGGCT ATTTCCTGGT GGGTAACGCG
CGCTATCCGA TTTCCGGTGC GCGTCTGGAT GACGTACCAA AAGGCACGCA ACCGGTGGTG
CCGGATGCCG ATCAGCAGGC TAATCTGATG GGCGGCGAAG CGGCGCTGTG GGCGGAAAAC
GTGGTCGCAC CGGTGCTGGA TATCAAGCTG TGGCCGCGCG CGTTTGCGGT GGCGGAGCGT
CTGTGGTCCG CGCAGGACGT GAAGGATGTC GACAATATGT ACACCCGTTT GCAGGCGATG
GACACCTGGA CGACGGTATC GGTCGGCCTT CAGCAGCACA GCCAGCAGCA GGCGTATTTC
ATACGTCTGG CGAATACGAC CGAGACGCTG CCGCTGCAGA TTCTCGCGCA GGCGCTGGAG
CCGGCGCAGT ATTACACCCG TCAGCATCTC AAATTCCAGG CCGGAAATTA TCATCAGTTT
GAGCCGCTAA ACCGTTACGC CGATGCGCTG AGCGCGGAGA GCAACACCGT GCGCCAGATG
AACAAATGGG CCGAACGCCT GGTCAGCGAT GCGGAAGACA CCGAAAGCGC AGAGGCGCTG
CGCCACGTGT TTACCCGCTG GCAAAGCAAT ACCAGCGATG CGCTGGCGCT GAGTGACAAT
AATTATCAGC TCAAAGCCAT CAAGCCCGTT ATTCAGGAGG TGGATAAGCT GGCATCGATT
GGCCTGCGGT TGGCCGACCT GGTGGCGCGA CAGGGTACGC TGGATGACAA GGAGATCGCT
TCTATTCAGA AGGAGTTGGA TAAGGCCGCG GAGATTCAGG ATGAAGTGGT GATTGCGGCG
GTTTATCCGG TTGAGACGTT GCTAAGGGCG ACAAGGAATC AGTAA
 
Protein sequence
MLRYNLLTAG LLLGSSALAA PAGDLPLMPW PAHVERPTTQ GALVLNDKLS VSVSGDDLGD 
AVDRLRQRIA LQTGWTLQPQ AVNPDKPTIR IAIAKKVNPQ PLPDSDERYT LTVDANGVNI
AANTRFGALR AIETLLQLIQ NGAENTSLPW VKIEDAPRFP WRGLLLDSAR HFIPLEDIKR
QIDGMAAAKL NVLHWHLTDD QGWRFASKRY PKLTQLASDG LFYTSDQMRD IVRYATARGV
RVVPEIDMPG HASAIAVAYP ELISAPGPYE MERHWGVLKP VLDPTKEATY AFAEAMVSEL
AAIFPDPYLH IGGDEVDDTQ WKENKAIQQF MRDNKLADSH ALQAYFNRKL ETILEKHHRQ
MVGWDEIYHP DLPKSILIQS WQGQDALGEV AKQGYKGILS TGFYLDQPQS TAYHYRNEIV
PQGLNGVDII ADNDSAQSWT FTMPRLKGKP VEGSFTLVKA VSGWRGFIDF NGKSRRAVNN
IEWRDDNQVT FTVDTWMGET RPVVNVADDK LTGYFLVGNA RYPISGARLD DVPKGTQPVV
PDADQQANLM GGEAALWAEN VVAPVLDIKL WPRAFAVAER LWSAQDVKDV DNMYTRLQAM
DTWTTVSVGL QQHSQQQAYF IRLANTTETL PLQILAQALE PAQYYTRQHL KFQAGNYHQF
EPLNRYADAL SAESNTVRQM NKWAERLVSD AEDTESAEAL RHVFTRWQSN TSDALALSDN
NYQLKAIKPV IQEVDKLASI GLRLADLVAR QGTLDDKEIA SIQKELDKAA EIQDEVVIAA
VYPVETLLRA TRNQ