Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3148 |
Symbol | |
ID | 4075020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 128830 |
End bp | 129972 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638004651 |
Product | N-acetylglucosamine 6-phosphate deacetylase |
Protein accession | YP_611384 |
Protein GI | 99078126 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00301964 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCATC CCCTTACGAC CTACCTCGGC GGGCCGATCT TTGATGGCAA GCACGTGCTG CAAGGCTTTG GCGCGCAGTT TCGCGAGGGC GCGCTTGTGG CGCTTGCCCC GGTGGCGGAG CTGCAAGAGC AGGGGGAGGT GATTGACCTA GGTGGGGACC TTCTGTCGCC CGGCTATGTC GACCTGCAAG TCAACGGCGG CGGGGGGGTG ATGCTGGGCG ATGCGCCCAA TGTCGAGACC ATTCGCAAGA TCTGCGCGGC GCATCGCAGC CTGGGTGCCA CCACGATCCT GCCGACGCTG ATCACCGACA CCGCCGAAAA GACCCGTGCG ACACTTGAGG CCGGGATTGC CGCGCATGAG GCCGGCGTGC GTGGGTTTGG CGGGTTGCAT CTGGAAGGTC CACATCTGTC GGTTGCTCGC AAGGGCGCCC ATGACGCCAA CCTGATCCGC GCAATGGATG ACAGCGACCT TGCTGCGATC TGCACGGCGG CTGCACGCTT GCCCAAGCTC AAGGTCACGG TGGCGGCAGA AAGCGTCACC CCGGAGCAGG TGATGCGGAT GGTCGAAGCG GGTGTGCTGG TATCGCTTGG TCACACGGAT GCGCCCTTTG ATACCTGCGT GGACTATGTG CGGGCCGGTG CGCGCTGTGC CACCCATCTG TTCAACGCCA TGAGCCAGCT TGGCAACCGG GCGCCGGGGC TGGTGGGAGC GGTGCTTGAT ACCGCAGAGC TTTCGGCGGG TGTGATTGCG GATGGGATCC ATGTACATCC TGCAAGCCTG CGCGCCGCCT GGCAGGCAAA GCGGCGCGGC CCCGGGCACC TGTTCCTCGT CTCGGACGCG ATGGCAGTTG CCGGGACCGA GGATCGCGAA TTCCTGCTCG AAGGCCGCCG GATCACGCGC AGTGACGGAC GGCTGTGCCT GTCGGATGGG ACTTTGGCTG GTGCGGATCT TGATCTGACC ACGGCCCTGC GGGTTCTGGT CAGCCAATGC GATGTGCCGC TCGCCGAGGG GCTAGAGGCG GCAACATCTG TGCCCGCCGC CCTGATCGGC AAGTCGGTGG ATCTGACGCA GCCGGGACAG AAGCAGGTGG ATATGATCCG CATCAAGCCG GAGCTCAGCG CCGCCGCGCC GGTACTGCCC TGA
|
Protein sequence | MMHPLTTYLG GPIFDGKHVL QGFGAQFREG ALVALAPVAE LQEQGEVIDL GGDLLSPGYV DLQVNGGGGV MLGDAPNVET IRKICAAHRS LGATTILPTL ITDTAEKTRA TLEAGIAAHE AGVRGFGGLH LEGPHLSVAR KGAHDANLIR AMDDSDLAAI CTAAARLPKL KVTVAAESVT PEQVMRMVEA GVLVSLGHTD APFDTCVDYV RAGARCATHL FNAMSQLGNR APGLVGAVLD TAELSAGVIA DGIHVHPASL RAAWQAKRRG PGHLFLVSDA MAVAGTEDRE FLLEGRRITR SDGRLCLSDG TLAGADLDLT TALRVLVSQC DVPLAEGLEA ATSVPAALIG KSVDLTQPGQ KQVDMIRIKP ELSAAAPVLP
|
| |