Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3530 |
Symbol | |
ID | 8449149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3879716 |
End bp | 3880873 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645042608 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003202844 |
Protein GI | 258653688 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0000188303 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00829445 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGAGC AACGACCACG GGTGGTGGTG GTGGCCGAGG CCGCACCGAT GCGCGGGGGG ATCGCCACCT TCGCCGAGAC CATCACCGCC GATCCCCTGC TGGCCCGCGA CTACGACGTC GAGTTGCTCA ACACGGCCCG GGTGGCCACT CGGGAGGGCG GCCGGTTCAA CCTGGACAAC GTCCGCTACG CGCTGGCCGA TGCCTGGCGG GTGTTCAAGG CCGCCCGCCA GGCCGACATC GTGCACCTGC AACTGGTGGC CGACCCGGGC CTGCCCGCGC TGCGCGCCGC CGCGCTGAAC CTGGCCGCCT CGGCCGGCCG GGCCAAGCTC ATCGCGCACG TGCATTCCGC CGTGGGCAAT GCCGGGCGCC CCGAGATCGC CGGCTACGGA CGGGTCGACC GGATGGCCCT GCGCACGCTG CGGCGAGCCC GCCTGGTGTG CACCGTGTCC ACCGTCGGCA CCGCCACCAT GCAGGCCCTG GCCCCTCGCA CCTGGGTCGA GACCGTCGAC AACGCGGTCA ATCTCGACGA CTTCCCGGCC GGCACGGCCG ACACCGAGCC GCCCACCGTG CTGTTCGTCG GCGTCATCTG CCAGCGCAAG GGCCTGTTCG AACTGGCCCG GGCGGCCCGG CTGCTGCACG AGCGCGGCAT CCACGACTGG AACCTGGTGG TGGTCGGCGG CCAGGGCCCG ACCCCGCAGG CCGAGTACGA CCAGATCGTG GCCGAGTTCG ACGCCGCCGG ATTGCGCGCG GCGATGGTCG GCCCGGAATA CGGCGACCAG ATCAAGGCCC GGCTGGCCTC GGCCGACATC TTCGTGCTCC CCTCGTTTCT GGAGGGTCAA CCGATCGCGA TCATCGAGGC CATGGCCTCC GGGCTGGCCG TCGTGGGCAC CTCCATCGGC GCGGTGCCCG ACCTGATCCG CGACGGCGTC GAGGGCCGGG TGGTCGAGCC CGGGGACGCG CCGGCCCTGG CCGACGCGCT GGCCCAGGTC ATCGGCGATC AGGACGCCCG CCGGCAGATG GGCCACGCGG CCCGTGAGCG GGCCGAACGC TCGCACGGGC TCGAGCAACT GTCCGCGCGG CTGAACTCCC TGTACCGGGC GGTGCTCTCG GGCCGGCGCA GCGCTGCGGT CGAGCGGTCG GTGACCGTCC CGCAATGA
|
Protein sequence | MSEQRPRVVV VAEAAPMRGG IATFAETITA DPLLARDYDV ELLNTARVAT REGGRFNLDN VRYALADAWR VFKAARQADI VHLQLVADPG LPALRAAALN LAASAGRAKL IAHVHSAVGN AGRPEIAGYG RVDRMALRTL RRARLVCTVS TVGTATMQAL APRTWVETVD NAVNLDDFPA GTADTEPPTV LFVGVICQRK GLFELARAAR LLHERGIHDW NLVVVGGQGP TPQAEYDQIV AEFDAAGLRA AMVGPEYGDQ IKARLASADI FVLPSFLEGQ PIAIIEAMAS GLAVVGTSIG AVPDLIRDGV EGRVVEPGDA PALADALAQV IGDQDARRQM GHAARERAER SHGLEQLSAR LNSLYRAVLS GRRSAAVERS VTVPQ
|
| |