Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3553 |
Symbol | |
ID | 8449172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3902195 |
End bp | 3903367 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645042630 |
Product | glycosyltransferase, MGT family |
Protein accession | YP_003202866 |
Protein GI | 258653710 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.000992125 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0041282 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGATGC TGGTCAGCTT CGCCGGCGGC ACCGGGCACT TCCTGCCGCT GGTTCCGCTG GCCCGGGCGG CCCGCGCCGA GGGCGACGCG GTCCTGGTCA CCGGCCAGGC GGCGCTGCTG CCCACCGTGA CGGCGGCCGG GTTCACCGCG GTCGACAGCG GCGGCACCAC CCTGGCCGAC CCGGCGGCCC GGCGCGACCT CGCGCGCGTG GACCGGGCGG CCGAGGCGGC CGTGATCCTC GACGTCTTCG CCGGCTCTCT GGCCCGGTCC CGCGCCGCGC GACTCATGGC CATCGCCCGG CACTGGCGTC CGGACGTAAT CGTGCACGAC GAGATGGACT TCGGCGCCGC CCTGACCGCC GAGAGCCTGG GCCGGCCGCG GGTCGAGATG ACCGTGTTGC TGGCCGGCGG AACCGTCGAT CGGCGGCACT TGACGACTCG GATCGAGCGC ACCCGCCGAT CCATGGGCCT GACGGCGCGG CCGGGCTCCC GGCGGCTCAC CCTGGTGCCG GCGCCGCCCG GATTCCGCGA TCCGGCCGAT CCGCTGCCGC CGCCCGTGCT CTGGATCCGC CCCGACGTGC TGGAACCGGT CCCGAATGAA CAGGATCCGG CGACCCGACG CACGCTGGCC TGGCTCGCCC GCCAACCGGC GCGGCCGCGA ATCCTGTTCA CGCTGGGCAC GATCTTCCAT CAGGAATCCG GCGACCTGTT CAGCCGGGCG GTGGCCGGAT TGAGCCAATT GGACGCCTCG ATCGTGGTGA CGGTGGGGCG CGAAATCGAC CCGACCGAGC TGGGTCCGAT GCCCCCGCAC GTGCACGTGG AGCGCTTCGT TCCGCAGGCG TCCGTGCTCC CGCACTGCGA TCTGGTGGTC TGCCACGCCG GCTCGGGCAG CGTCATGGGC GCCCTGGCGT TCGGCCGGCC GATGCTGCTG CTGCCGATGG GCGCCGACCA GCCGGCCAAC GCGGACCGCT GCGCGGACCT GGGCGTCGCG ACGGTGCTCG ATCCCCTGCT CGCCACCGTC GACGACGTGA CCACGGCGGC GCGAGAGCTG TTGCTCGATC CGACATTCCG CCGGCGCGCC GCGTCGTGGC GATCGGCCGC TGCCGGCCTC CCCACGGCGG CGCAGGCTCT GGACCGGGTT CGTCGCCTGG TCGACTTCAC ACCGTCCTCT TGA
|
Protein sequence | MRMLVSFAGG TGHFLPLVPL ARAARAEGDA VLVTGQAALL PTVTAAGFTA VDSGGTTLAD PAARRDLARV DRAAEAAVIL DVFAGSLARS RAARLMAIAR HWRPDVIVHD EMDFGAALTA ESLGRPRVEM TVLLAGGTVD RRHLTTRIER TRRSMGLTAR PGSRRLTLVP APPGFRDPAD PLPPPVLWIR PDVLEPVPNE QDPATRRTLA WLARQPARPR ILFTLGTIFH QESGDLFSRA VAGLSQLDAS IVVTVGREID PTELGPMPPH VHVERFVPQA SVLPHCDLVV CHAGSGSVMG ALAFGRPMLL LPMGADQPAN ADRCADLGVA TVLDPLLATV DDVTTAAREL LLDPTFRRRA ASWRSAAAGL PTAAQALDRV RRLVDFTPSS
|
| |