Gene Nmar_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0124 
Symbol 
ID5774359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp113741 
End bp114796 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content27% 
IMG OID641315744 
Productglucose-1-phosphate thymidyltransferase 
Protein accessionYP_001581462 
Protein GI161527636 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000322504 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAGGAA TAATTTTACA TGGTGGTCAT GGAACACGAC TAAGGCCTTT AACCCATACA 
GGACCAAAAC AACTACTTCC AATTGCAAAT AAACCAATGT CTCAATACTG TATTGAATCC
ATGAAGAATG CAGGGATTAC AGAAATTGCC ATCATTATTG GAGGTATAGC TTCTAAAAAA
GTCGAAGAAT ATTATGGAAA TGGAGAGAAA TTTGGAGTAA AAATCACGTA TATTTCACAA
GAAGCGCCAA AAGGTATTGC TCATGCAATA AATCTATGCA AAGATTTTGT TAAAGATGAT
AAATTCCTTG TATTTTTAGG AGACAATATT TTAAAAAAAG AAATTTTGGA ATACAAAACC
AATTATGAAA ATTCTGATGC AGATGCACTA TTGTTATTAT GTGAAGTAGA TAACCCTACA
CAATTTGGAA TTGCAGATGT TAAAGATAAT AAAATTATCA AGATCATGGA AAAACCAAAG
GATCCACCAA CAAATCTTGC AGTTACAGGA ATTTATTTTC TAAATAAAAA AATTTTTGAA
ATTATTGATA TCTTAAAACC TTCATGGAGA AACGAGTTAG AGATTACTGA TGCACTACAA
TTATTGATGG AAAAAGGAAA TAAAATTATC TTTGACACTG TAACTGATTA TTGGAAAGAT
ACAGGAACTC CAAATGATAT TTTACATGCA AATAAAGAAA TTCTTCAAGA TATTTCTCAA
GAATTTTTGG GAGAAAAAGA ACAAACTCAA ATTGATGGTG TTTGTGTTTT AAAAGAAAAA
TCATTGCTAA AAAATGTAAA AATAATTGGA CCAGTCTTAA TTGGAAAAAA TTGTATTATT
AATAATAATT CAGTTATTGG TCCTAATGTT AGTATTGGAG ATAATTGTAA AATTTCAAAA
AGTAAAATTG AGAATTCAAT AATTATGAAT AATTGTGAAA TTAATTCAAA TATAAAAATT
TCAGATAGTA TAATTGCTTT TGATTGTCAG ATTTTTCAAG AAAAAAATGA AAAGAATGTT
TTGCTTCTAG GTGAAGGAAC AAAAATTTGG ATTTAA
 
Protein sequence
MKGIILHGGH GTRLRPLTHT GPKQLLPIAN KPMSQYCIES MKNAGITEIA IIIGGIASKK 
VEEYYGNGEK FGVKITYISQ EAPKGIAHAI NLCKDFVKDD KFLVFLGDNI LKKEILEYKT
NYENSDADAL LLLCEVDNPT QFGIADVKDN KIIKIMEKPK DPPTNLAVTG IYFLNKKIFE
IIDILKPSWR NELEITDALQ LLMEKGNKII FDTVTDYWKD TGTPNDILHA NKEILQDISQ
EFLGEKEQTQ IDGVCVLKEK SLLKNVKIIG PVLIGKNCII NNNSVIGPNV SIGDNCKISK
SKIENSIIMN NCEINSNIKI SDSIIAFDCQ IFQEKNEKNV LLLGEGTKIW I