Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2799 |
Symbol | |
ID | 8448412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3070503 |
End bp | 3071483 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645041891 |
Product | protein of unknown function DUF199 |
Protein accession | YP_003202133 |
Protein GI | 258652977 |
COG category | [S] Function unknown |
COG ID | [COG1481] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00647] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000000867405 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000150657 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGATGA CGGCCTCGGT CAAGGATGAA CTGAGCCGGG TGGTGGTCAG CAAGGTCTCC GCCCGTAAGT CGGAGGTGTC GGCGCTGCTG CGCTTCGCCG GGGGTCTGCA TATCGTGGCC GGCCGGGTGA TCGTCGAGGC CGAACTGGAC ACCGGCGCGG TCGCCCGGCG GCTGCGCAAG GAGATCGCCG AGCTGTACGG GCATGCCGCC GAGGTGCAGG TGCTGGCCGC CTCCGGAATC CGCAAGAGCA CCCGCTACGT GGTCCGGGTG GTCGGCGGCG GCGACTCGCT GGCCCGGCAG ACCGGTCTGA TCGACCAGCG GGGACGGCCG GTGCGCGGGC TGCCGCCGCA GATCGTGGCC GGCTCGGTGG GCGATGCGGA GGCGGCCTGG CGCGGCGCGT TCCTGGCCCA CGGCTCGCTG ACCGAACCGG GCCGCAGCGC GTCCCTGGAG ATCACCTGCC CCGGCCCCGA GGCCGCCCTC GCCCTGGTCG GCGCGGCCCG GCGGATGGGC ATCGGGGCCA AGGCGCGCGA GGTGCGGGGC GCCGACCGGG TACTGGTCCG GGACGGCGAC GCGATCGCCG CGCTGCTGAC CCGGCTGGGC GCGCACTCCT CGATGATGGC CTGGGAGGAG CGGCGGATGC GCCGGGAGGT CCGGGCCACC GCGAACCGGC TGGCCAACTT CGACGACGCC AACCTGCGCC GGTCGGCACG GGCCGCGGTG GCCGCCTCGG CCCGGGTCGA GCGGGCCCTG GAGATCCTCG GCGAGGACGC CCCCGATCAT CTGAGCCAGG CCGGACGGCT GCGGATCGCG CACGGTCAGG CCTCGTTGGA GGAGCTCGGT CAGCTGGCCG ACCCGCCGAT GACCAAGGAC GCCGTGGCCG GCCGGATCCG GCGTTTGCTG ACCATGGCCG ATCGTCGGGC CGCCGAACTG GGCATTCCCG ACACCGAGTC CGCCGTCACC GCAGACATGC TCGACTCCTG A
|
Protein sequence | MAMTASVKDE LSRVVVSKVS ARKSEVSALL RFAGGLHIVA GRVIVEAELD TGAVARRLRK EIAELYGHAA EVQVLAASGI RKSTRYVVRV VGGGDSLARQ TGLIDQRGRP VRGLPPQIVA GSVGDAEAAW RGAFLAHGSL TEPGRSASLE ITCPGPEAAL ALVGAARRMG IGAKAREVRG ADRVLVRDGD AIAALLTRLG AHSSMMAWEE RRMRREVRAT ANRLANFDDA NLRRSARAAV AASARVERAL EILGEDAPDH LSQAGRLRIA HGQASLEELG QLADPPMTKD AVAGRIRRLL TMADRRAAEL GIPDTESAVT ADMLDS
|
| |