Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2145 |
Symbol | |
ID | 8447756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2368541 |
End bp | 2369422 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645041268 |
Product | dihydropteroate synthase |
Protein accession | YP_003201512 |
Protein GI | 258652356 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00151259 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00437395 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGGGA CAGCGCACCC GGCGGTGGTC CTGCGGGGAC GGCCGCTGCC CGCCGACCGC GCGGCCGTCA TGGCCATCGT GAACCGCACC CCTGACTCGT TTTTCGATGC CGGCGCCACC TTCGGGGAGA CCGCCGCGAT GGAGGCGGTG CGCCGCGCCG TCGACGAGGG CGCCGACCTG GTGGACATCG GCGGGGTGAA GGCCGGTGTC GGGCCCGAGG TCACCGAGTC GGACGAGATC GATCGGGTGG TCCCGTTCGT CGCCCGGGTG CGGGAGCGCT TTCCGGAGTT GCCGATCAGC GTGGACACCT GGCGGGCCGG GGTCGCGGTG GCCGCCTGCG AGGCCGGGGC GGACCTGATC AACGACACCT GGGCCGGCGC CGATCCGGAC CTGGCCCACG TCGCGGCGCG CTTCGGGGCG GGCATCGTCT GCTCGCACAC CGGGGGTGCG GCTCCCCGGA CCAATCCCTA TCGGGTGAGC TATCCGGACG TGGTCGCCGC GGTCATCGAG CAGACCACCA CGGCGGCCCG GCGAATGGTC GAGCTGGGGG TGCCGCGGGC CGGTGTCCTG ATCGACCCGA CGCACGACTT CGGCAAGAAC ACCTGGCACA GCCTGGAGCT GGTACACCGC ACCGGAGAGC TGGTGGCCAC CGGCTGGCCG GTGCTGATGG CCCTGTCCAA CAAGGACTTC GTCGGCGAGA CCCTGGACCT GCCGGTCGAC CAACGGCTCG AAGGCACCCT GGCGGCCACC GCGATCGCCG TGTGGCAGGG CGCCGCGGTC GTCCGCGCCC ATCAGGTGCG GGCCACCCGC CGGGTGGTGG AGATGGCCGC CGCCGTGGCC GGTACCCGGG CGCCGCTGGC CCCACGCCGT GGCCTGGTCT GA
|
Protein sequence | MTGTAHPAVV LRGRPLPADR AAVMAIVNRT PDSFFDAGAT FGETAAMEAV RRAVDEGADL VDIGGVKAGV GPEVTESDEI DRVVPFVARV RERFPELPIS VDTWRAGVAV AACEAGADLI NDTWAGADPD LAHVAARFGA GIVCSHTGGA APRTNPYRVS YPDVVAAVIE QTTTAARRMV ELGVPRAGVL IDPTHDFGKN TWHSLELVHR TGELVATGWP VLMALSNKDF VGETLDLPVD QRLEGTLAAT AIAVWQGAAV VRAHQVRATR RVVEMAAAVA GTRAPLAPRR GLV
|
| |