Gene Namu_4151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4151 
Symbol 
ID8449777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4585831 
End bp4587282 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content73% 
IMG OID645043200 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_003203429 
Protein GI258654273 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0115594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00443199 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCAGCA CGGCGCCCGG AGCGCAGAAC CAGGACCTGG GCTTCGCCGG GATCCCCGCG 
GCCGACTTCA CCGAGGCGCC CCGGCGGCCC CGGGCCGACC GCGGCCCGGC CGGCGACGCG
GCCGTCGACA CGGGGTACGC CGGGATCCCG TTCGAGGACG GCACCGCGGC GCCGCCCCGG
CCCTGGGACC CGAACCGGCC CAAGACCGAG GCCGGCTTCG ACACCATCGA TTTCGCCCTG
GCCGAGCTGG CCGCCGGCCG CGCGGTCGTC GTTGTCGACG ACGAGGACCG GGAGAACGAG
GGTGACCTGA TCTTCGCCGC CGAGCTGGCC ACCCCCGAGC TGATGGCCTT CACGGTGCGC
CACTCCTCGG GCGTGGTCTG CGTCGGTCTG ACCGGAGACG CCTGCGACCG GCTCGACCTG
CCGCCGATGT ACCACCGCAA CCAGGACCGC AAGTCGACCG CGTTCACCGT CAGCGTCGAC
GCCAAAGAGG GCGTCACCAC CGGCATCTCG GCGGCGGAAC GGGCGCACAC GGTGCGCCTG
CTGGCCGACC CGGCGGCCAC CGACGAGGAC CTGTCGCGGC CCGGCCACGT CTTCCCGCTG
CGCGCCCGCG ACGGCGGCGT GCTGGTCCGC CCCGGGCACA CCGAGGCCGC CGTCGACCTG
GCCGCCCTGG CCGGGCTGCA GCCGGCCGGT GCCCTGTGCG AGATCGTCAA CCACGACGGC
TCGATGTCCC GGCTGCCCGA CCTGCAGGTC TTCGCCCGCC GGCACCGGCT CGCGCTGATC
TCCATCGCCG ACCTGATCGC CTACAAGCGG GCCCGCGAGG TGCAGATCCG CAAGGTCGCC
AGCGCCCGGC TGCCGCTGCC GCAGGGCGTG TTCACGGCCG TCGGCTACAT CAGCACGGTC
ACCGGCCGGG AGCTGATCGC ATTGGTGGCC GGCGAGATCG GCGACGGCCG GGACGTGCTG
GTGCGCGTGC ACTCGGAGTG CCTGACCGGT GATGTGCTCG GATCGCTGCG CTGCGACTGC
GGTCCGCAGC TGCAGGCCGC GCTGCAGGCG GTCGCCGACG AGGGGCGCGG CGTGGTGCTC
TACATCCGTG GGCACGAGGG CCGGGGGATC GGTCTGCTGG ACAAGCTGCG GGCCTACGAG
CTGCAGGACG CCGGGGCGGA CACGGTCGAT GCGAACCTGC AGCTGGGCCT GCCGTCCGAC
TCGCGCGAGT ACGGCACCGG CGCCCAGGTG CTGGCCGATC TGGGCATCAC CTCGATGCGG
CTGCTGACCA ACAACCCGGC CAAGCGGGCC GGGCTGGAGG GCTACGGCCT GTCGATCAAC
GGCCGGGTGT CGTTGCCGGC CCACGTCAAC CCCGAGAACC TGCGGTACCT GACCACCAAG
CGGGACCGGA TGGGGCACGA GTTGGACGGG CTGGACGGGA CGGACATCCT GTACGGCGAG
GGACACGCGT GA
 
Protein sequence
MTSTAPGAQN QDLGFAGIPA ADFTEAPRRP RADRGPAGDA AVDTGYAGIP FEDGTAAPPR 
PWDPNRPKTE AGFDTIDFAL AELAAGRAVV VVDDEDRENE GDLIFAAELA TPELMAFTVR
HSSGVVCVGL TGDACDRLDL PPMYHRNQDR KSTAFTVSVD AKEGVTTGIS AAERAHTVRL
LADPAATDED LSRPGHVFPL RARDGGVLVR PGHTEAAVDL AALAGLQPAG ALCEIVNHDG
SMSRLPDLQV FARRHRLALI SIADLIAYKR AREVQIRKVA SARLPLPQGV FTAVGYISTV
TGRELIALVA GEIGDGRDVL VRVHSECLTG DVLGSLRCDC GPQLQAALQA VADEGRGVVL
YIRGHEGRGI GLLDKLRAYE LQDAGADTVD ANLQLGLPSD SREYGTGAQV LADLGITSMR
LLTNNPAKRA GLEGYGLSIN GRVSLPAHVN PENLRYLTTK RDRMGHELDG LDGTDILYGE
GHA