Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2739 |
Symbol | |
ID | 4077611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2883772 |
End bp | 2885655 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638008064 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_614733 |
Protein GI | 99082579 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTTTC ACCTCGATAG CCTCTGGCAT GCCGAAGAAG GCGCGATGGA GTTCGCGCTG ACCAATTGTG GCACAACCCC CGTGACCAAC CCGCGCCTCG TTTACGCAAC GCTCACGCGG TGTTTGCGGC CTTCGAACTG CACCGGGGCA CGGCTTGTGC GGCGGCAGGC AAACTTTCAC GAATACGCCT CGGACGAGGG ATTCGTGCTC GCACCGGGAG AGACTTGGCG GTTCACGGAG CATAGCCTCA CCCGTCCGGC GCTTCATTCC AATGAGGGGC CAAAATCAGC GGGAGTCCTC TTGGAGGATG ACACGCTCGT GCTGGCATTT GCCGGGGACT TGCAGGCCTC TGTGGTGGAA GGTCACGCAA CAGCTGTGCA AAATGCGGCT CTAACATGCG GGATTCTGCC CGAACCCAAG CGTGTCGCTA TCTCGAACTG GGCTGAGACC GCCCCTGTGC ATCTCGCGCT TCAGAGTGAT GACGCAGCGG TGATGCAGTT GGTATCGCGC GTCTCTGAGC TGACTCGCCG TTTGCATCCG CTTGCGCCGA CACCGTTTGT GTTGACCGCC AAGGAGGACG TGGCACTGCT CTGCATTACG CCGGATCAGA CACTCCCCGC AGATGGCTAT CGTATCACTT GGGACGAAGG GAAAACCACC CTGCATCACG GCAGCGGGCG GGGGCTGTTT TACGGTCTGG TGTCGCTGGC GCAGATGCTC ACCCATGCCC ACGCTGAGCC TCAGCGCTAT GGCGTGCCGC TCAGCGGAGA GATCGAGGAC GCCCCCCGCC ACGGTTGGCG CGGTGCGCAT CTGGATGTCT CGCGCCAGTT TTACCCGCTC GATCAGGTTC TGCGCTACGT GGACATCATG GCGTGGCACA AGATGAACCG GTTCCATTGG CACCTGACTG ACGATGAAGG CTGGCGGCTG GAGATCAAAG CCTATCCGCA GCTCACTGAG ACCGCCGCAC ATACCGGCAT GGACCTGCCC GTCTTGCCGC AGCTTGGCCC AGACATGACC GGGCAGAGCG GTTTTTACAC CCAGGACGAG GCCCGTCAGG TGGTGAAACA CGCCGCGCAG TTCGGCATCG AAGTAATGCC GGAGATCGAC GTGCCCGGTC ACTGTGCTTG CGTTCTGGGC GCGTTGCCTG ATCTGGTCGA TCCCGAAGAG CCCGAGAGCT ACTGGTCGGT GCAGGGGTTT GCCAACAACG CGCTCAACCC GGCGATCGAG GAGTCTTATA CCTTTGCCGA GACCGTTTTG GCCGAGGTCT GCGAGATCTT CCCGTTTGAG GTCGTTCATG TGGGGGGCGA TGAGGTGGCC GAGGGCGCTT GGATGCAATC GCCCAAAGCG CAGGCGATGA TGCGCGAAAC GGGTCTAAAG GACACGCCGC AATTGCAGGC TTATTTCCTG CGTCACATCC AGACCTATCT GGCGGGGCTT GGTCGCAAGC TTGGCGGTTG GGAAGAGGTG GCCCATGGCG GTGGTCTTGA TCCCGAGCAC AGCCTTTTGT TTGCCTGGAC CACAATCGAG AAAACCGCGG AGCTGGCGCA AGAAGGCTAT GACGTCATCA GCACGCCTGG GCAGGCCTAC TACCTTGATA TGGCGCTGTC GGATGCGTGG TATGCACCGG GCGCCAGCTG GGCGGGTTTC ACCCCGCTCG ACAAGACTTA TGCGTTTGAG GCCGACAATG GCGACCCAGT GCTTCAGGGG CGGCTCAAAG GTGTGCAGGC CTGCGTCTGG AGTGAGCATC TGACCACAAT GGCCCGGCGC AATCACATGA TCTTTCCGCG CCTCAGCGCC ATTGCAGAGG CCGGGTGGAG CGCAGCCGAA AACAAAGCCT ATGACCGGTT CAAGTCGCTT GCAGAGTTGA TGCCGCGCCT CTGA
|
Protein sequence | MHFHLDSLWH AEEGAMEFAL TNCGTTPVTN PRLVYATLTR CLRPSNCTGA RLVRRQANFH EYASDEGFVL APGETWRFTE HSLTRPALHS NEGPKSAGVL LEDDTLVLAF AGDLQASVVE GHATAVQNAA LTCGILPEPK RVAISNWAET APVHLALQSD DAAVMQLVSR VSELTRRLHP LAPTPFVLTA KEDVALLCIT PDQTLPADGY RITWDEGKTT LHHGSGRGLF YGLVSLAQML THAHAEPQRY GVPLSGEIED APRHGWRGAH LDVSRQFYPL DQVLRYVDIM AWHKMNRFHW HLTDDEGWRL EIKAYPQLTE TAAHTGMDLP VLPQLGPDMT GQSGFYTQDE ARQVVKHAAQ FGIEVMPEID VPGHCACVLG ALPDLVDPEE PESYWSVQGF ANNALNPAIE ESYTFAETVL AEVCEIFPFE VVHVGGDEVA EGAWMQSPKA QAMMRETGLK DTPQLQAYFL RHIQTYLAGL GRKLGGWEEV AHGGGLDPEH SLLFAWTTIE KTAELAQEGY DVISTPGQAY YLDMALSDAW YAPGASWAGF TPLDKTYAFE ADNGDPVLQG RLKGVQACVW SEHLTTMARR NHMIFPRLSA IAEAGWSAAE NKAYDRFKSL AELMPRL
|
| |