Gene Nmar_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0125 
Symbol 
ID5774384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp114842 
End bp115831 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content26% 
IMG OID641315745 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001581463 
Protein GI161527637 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000332507 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAATTT TAGTTACAGG AGGATTAGGA TTTATTGGTA GTAATTTTAT AATTAATTAT 
TTAAACGAAT TCCCTGAACA TACCATAATT AATTTAGATA ATGAAAATCA TGGAGCCAAC
CATCAAAATT TGATTTCAAT ACAAAAGAAA AATAATTATG AATTCGTTAA AGGAGATATC
ACAAATCATA AACTAATGAA AAATTTGATT TCTATATCTG ATGCAATAGT AAATTTTGCA
GCAGAATCCC ATGTTGATCG AAGTATTTCA GATGCAACAC CATTCATAAA CTCAAATATT
TTAGGGGTAT TTACAATTCT AGAAATTTTA AAAAAAGAAA AAGAAAAAAG GTTAGTTCAG
ATATCAACAG ATGAAGTTTT TGGAAGTTTA AAAAAAAATA GCGCAAATGA GAATTTCAAA
TTAAATCCAT CCAGCCCATA TTCATCATCC AAAGCTTCAG CAGAATTGTT AGTTAATTCT
TATTTTGTAA CATATGAAAT AGATACAGTA ATAACACGTT GTACTAATAA TTATGGACCT
AGACAATTTC CTGAAAAATT AATACCAAAA ACTATTCTAT TAGCAATGCA AAAGCAAAAA
ATTCCAATAT ACGGAAATGG GAAAAATATT AGAGATTGGA TTCATGTTGA TGATCATTGT
AATGCAGTCA AAGAAGTTTT ACATAAAGGA AAATCTGGAG AATCATATAA CATTTCAGCC
CAAAATGAAT TGGATAATAT TCAAATTGTT ACAAATATTT TGGAAAAAAT GGGATTGAAT
GATGATTATT TAGAATTTGT AGAAGATAGA CCTGGGCATG ATTTTAGATA TAGTTTAGAT
TCATCAAAAA TAAGAAATGA ATTAAAATGG AAAGAAGAAA CAAGCTTTGA AGATGGAATT
GAAAAAACAA TTGATTGGTA TGTTAAAAAT CAAGAATGGT GTAACGGTAT TAATAAAGAG
ATTTTAAAAA AAGCCAAATG GAATAACTAA
 
Protein sequence
MKILVTGGLG FIGSNFIINY LNEFPEHTII NLDNENHGAN HQNLISIQKK NNYEFVKGDI 
TNHKLMKNLI SISDAIVNFA AESHVDRSIS DATPFINSNI LGVFTILEIL KKEKEKRLVQ
ISTDEVFGSL KKNSANENFK LNPSSPYSSS KASAELLVNS YFVTYEIDTV ITRCTNNYGP
RQFPEKLIPK TILLAMQKQK IPIYGNGKNI RDWIHVDDHC NAVKEVLHKG KSGESYNISA
QNELDNIQIV TNILEKMGLN DDYLEFVEDR PGHDFRYSLD SSKIRNELKW KEETSFEDGI
EKTIDWYVKN QEWCNGINKE ILKKAKWNN