Gene TM1040_2739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2739 
Symbol 
ID4077611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2883772 
End bp2885655 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content61% 
IMG OID638008064 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_614733 
Protein GI99082579 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTTC ACCTCGATAG CCTCTGGCAT GCCGAAGAAG GCGCGATGGA GTTCGCGCTG 
ACCAATTGTG GCACAACCCC CGTGACCAAC CCGCGCCTCG TTTACGCAAC GCTCACGCGG
TGTTTGCGGC CTTCGAACTG CACCGGGGCA CGGCTTGTGC GGCGGCAGGC AAACTTTCAC
GAATACGCCT CGGACGAGGG ATTCGTGCTC GCACCGGGAG AGACTTGGCG GTTCACGGAG
CATAGCCTCA CCCGTCCGGC GCTTCATTCC AATGAGGGGC CAAAATCAGC GGGAGTCCTC
TTGGAGGATG ACACGCTCGT GCTGGCATTT GCCGGGGACT TGCAGGCCTC TGTGGTGGAA
GGTCACGCAA CAGCTGTGCA AAATGCGGCT CTAACATGCG GGATTCTGCC CGAACCCAAG
CGTGTCGCTA TCTCGAACTG GGCTGAGACC GCCCCTGTGC ATCTCGCGCT TCAGAGTGAT
GACGCAGCGG TGATGCAGTT GGTATCGCGC GTCTCTGAGC TGACTCGCCG TTTGCATCCG
CTTGCGCCGA CACCGTTTGT GTTGACCGCC AAGGAGGACG TGGCACTGCT CTGCATTACG
CCGGATCAGA CACTCCCCGC AGATGGCTAT CGTATCACTT GGGACGAAGG GAAAACCACC
CTGCATCACG GCAGCGGGCG GGGGCTGTTT TACGGTCTGG TGTCGCTGGC GCAGATGCTC
ACCCATGCCC ACGCTGAGCC TCAGCGCTAT GGCGTGCCGC TCAGCGGAGA GATCGAGGAC
GCCCCCCGCC ACGGTTGGCG CGGTGCGCAT CTGGATGTCT CGCGCCAGTT TTACCCGCTC
GATCAGGTTC TGCGCTACGT GGACATCATG GCGTGGCACA AGATGAACCG GTTCCATTGG
CACCTGACTG ACGATGAAGG CTGGCGGCTG GAGATCAAAG CCTATCCGCA GCTCACTGAG
ACCGCCGCAC ATACCGGCAT GGACCTGCCC GTCTTGCCGC AGCTTGGCCC AGACATGACC
GGGCAGAGCG GTTTTTACAC CCAGGACGAG GCCCGTCAGG TGGTGAAACA CGCCGCGCAG
TTCGGCATCG AAGTAATGCC GGAGATCGAC GTGCCCGGTC ACTGTGCTTG CGTTCTGGGC
GCGTTGCCTG ATCTGGTCGA TCCCGAAGAG CCCGAGAGCT ACTGGTCGGT GCAGGGGTTT
GCCAACAACG CGCTCAACCC GGCGATCGAG GAGTCTTATA CCTTTGCCGA GACCGTTTTG
GCCGAGGTCT GCGAGATCTT CCCGTTTGAG GTCGTTCATG TGGGGGGCGA TGAGGTGGCC
GAGGGCGCTT GGATGCAATC GCCCAAAGCG CAGGCGATGA TGCGCGAAAC GGGTCTAAAG
GACACGCCGC AATTGCAGGC TTATTTCCTG CGTCACATCC AGACCTATCT GGCGGGGCTT
GGTCGCAAGC TTGGCGGTTG GGAAGAGGTG GCCCATGGCG GTGGTCTTGA TCCCGAGCAC
AGCCTTTTGT TTGCCTGGAC CACAATCGAG AAAACCGCGG AGCTGGCGCA AGAAGGCTAT
GACGTCATCA GCACGCCTGG GCAGGCCTAC TACCTTGATA TGGCGCTGTC GGATGCGTGG
TATGCACCGG GCGCCAGCTG GGCGGGTTTC ACCCCGCTCG ACAAGACTTA TGCGTTTGAG
GCCGACAATG GCGACCCAGT GCTTCAGGGG CGGCTCAAAG GTGTGCAGGC CTGCGTCTGG
AGTGAGCATC TGACCACAAT GGCCCGGCGC AATCACATGA TCTTTCCGCG CCTCAGCGCC
ATTGCAGAGG CCGGGTGGAG CGCAGCCGAA AACAAAGCCT ATGACCGGTT CAAGTCGCTT
GCAGAGTTGA TGCCGCGCCT CTGA
 
Protein sequence
MHFHLDSLWH AEEGAMEFAL TNCGTTPVTN PRLVYATLTR CLRPSNCTGA RLVRRQANFH 
EYASDEGFVL APGETWRFTE HSLTRPALHS NEGPKSAGVL LEDDTLVLAF AGDLQASVVE
GHATAVQNAA LTCGILPEPK RVAISNWAET APVHLALQSD DAAVMQLVSR VSELTRRLHP
LAPTPFVLTA KEDVALLCIT PDQTLPADGY RITWDEGKTT LHHGSGRGLF YGLVSLAQML
THAHAEPQRY GVPLSGEIED APRHGWRGAH LDVSRQFYPL DQVLRYVDIM AWHKMNRFHW
HLTDDEGWRL EIKAYPQLTE TAAHTGMDLP VLPQLGPDMT GQSGFYTQDE ARQVVKHAAQ
FGIEVMPEID VPGHCACVLG ALPDLVDPEE PESYWSVQGF ANNALNPAIE ESYTFAETVL
AEVCEIFPFE VVHVGGDEVA EGAWMQSPKA QAMMRETGLK DTPQLQAYFL RHIQTYLAGL
GRKLGGWEEV AHGGGLDPEH SLLFAWTTIE KTAELAQEGY DVISTPGQAY YLDMALSDAW
YAPGASWAGF TPLDKTYAFE ADNGDPVLQG RLKGVQACVW SEHLTTMARR NHMIFPRLSA
IAEAGWSAAE NKAYDRFKSL AELMPRL