Gene Namu_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3224 
Symbol 
ID8448838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3554178 
End bp3555566 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content70% 
IMG OID645042303 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003202544 
Protein GI258653388 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000420086 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000147183 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACTGGA CGATCGACAT CCCGGCGGAC ATCCTGCCGA GTCTGCCCCC GCTGCCGCCG 
GAGCTGCGGG CCCGGCTGGA CGACGCGCTG TCCCGGCCCG CCGCGCAGCA ACCGGAGTGG
CCCGACCCCG AGCAGGTGCT CGCCGTCCGG GCCGTCCTGG AGGCGGTGCC CCCGGTGACG
GTGCCCGGCG AGGTCGACAA GCTCGCCGAC CAGCTGGCCG CGGTGGCCCG CGGTGAGGCC
TTCCTGCTGC AGGGCGGCGA CTGCGCCGAG ACCTACGTCG ACAACACCGA GCCGCACATC
CGCGGCAACA TCCGCACCCT GCTGCAGATG GCCGTCGTGC TGACCTACGG CGCCTCACTG
CCGGTGGTCA AGGTGGCCCG CATCGCCGGC CAGTACGCCA AGCCCCGTTC CTCCAACATC
GACGCGCTGG GCCTGCCGTC CTACCGCGGC GACATCATCA ACTCGCTGTC CACCACCCCG
GAAGCGCGGA TCCCCGATCC GTCCCGGATG GTGCGCGCCT ACGCGAACTC CTCCGCGGCG
ATGAACCTGG TCCGCGCGGT CACCGCCACC GGCATGGGCG ACCTGGCCCG GGTCCACGAG
TGGAACCAGG AATTCGTCCT GACCTCGCGC GCCGGCGAGC GGTACGAGCG GGTGGCCAAG
GAGATCGACC GGGCCATGCG GTTCATGAGT GCGTGCGGCG TGACCTCGCA TTCACTGCAC
CAGGTCGACA TCTTCTCCTC GCACGAGGCG CTGCTGCTGG ACTACGAGCG GGCCATGCTG
CGGATGGACA CCAGCCACGA CGAGCCCCGG CTCTACGACC TGTCCGGGCA CTTCCTGTGG
GTGGGCGAGC GGACCCGGCA GCTGGACGGC GCGCACATCG CGTTTGCCCA GCTGCTGTCC
AACCCGATCG GGCTCAAGAT CGGCCCGAGC ACCACCCCGG AGATGGCCGT CGAGTACGTC
GAGCGGCTCG ACCCGCGCAA TCAGGCCGGC CGGCTCACGC TGATCAGCCG GATGAGCAAC
ACCAAGATCC GCGACGTGCT GCCGCCGATC ATCGAAAAGG TGGAGGCGTC CGGGCACCAG
GTCATCTGGC AGTGCGACCC GATGCACGGC AACACCCACG AGTCGCCGAC CGGCTACAAG
ACCCGCCACT TCGACCGCAT CGTGGACGAG GTGCAGGGCT TCTTCGAGGT GCACAACGAG
CTGGGCACCC ACCCGGGTGG CATCCACGTG GAGCTGACCG GCGAGGACGT CACCGAGTGC
CTGGGCGGGG CCCAGGAGAT CTCCGACGAC GACCTGGCCG GCCGCTACGA GACGGCGTGC
GACCCGCGGC TGAACACCCA GCAGTCGCTG GAGCTGGCCT TCCTGGTCGC GGAGATGCTG
CGCGGCTAG
 
Protein sequence
MNWTIDIPAD ILPSLPPLPP ELRARLDDAL SRPAAQQPEW PDPEQVLAVR AVLEAVPPVT 
VPGEVDKLAD QLAAVARGEA FLLQGGDCAE TYVDNTEPHI RGNIRTLLQM AVVLTYGASL
PVVKVARIAG QYAKPRSSNI DALGLPSYRG DIINSLSTTP EARIPDPSRM VRAYANSSAA
MNLVRAVTAT GMGDLARVHE WNQEFVLTSR AGERYERVAK EIDRAMRFMS ACGVTSHSLH
QVDIFSSHEA LLLDYERAML RMDTSHDEPR LYDLSGHFLW VGERTRQLDG AHIAFAQLLS
NPIGLKIGPS TTPEMAVEYV ERLDPRNQAG RLTLISRMSN TKIRDVLPPI IEKVEASGHQ
VIWQCDPMHG NTHESPTGYK TRHFDRIVDE VQGFFEVHNE LGTHPGGIHV ELTGEDVTEC
LGGAQEISDD DLAGRYETAC DPRLNTQQSL ELAFLVAEML RG