Gene Nmar_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0048 
Symbol 
ID5774119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp37114 
End bp38277 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content31% 
IMG OID641315665 
Productthiamine biosynthesis ATP pyrophosphatase-like protein 
Protein accessionYP_001581386 
Protein GI161527560 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTACCC GAAGTTTTGT CCTAAATTAT AAACAAGTAG ATTTCAAAGT AGAAAATATG 
AATGAAACAT CATATGTTGT AGTTTTTCCG ACTATATTTT CAAAAAATAA AATTCCACAG
TTAATTTCCA ATATCAAAAA GATTCTCAAA ATAAAAAATC AAGAATTCAA ATCAGTAAAG
AGAGACGGAG AGATTATTTT AGTAGATGCA AATGATCCAG TATTTGCATC ATCTGCAATC
AACATGCTTT TTGGAATTCA AGAGGTTGCA ATTGCAAGAC AGAAAAAAAA TGATTATCAA
GAAATTGTTT CTGAAATCAC TTCTGTAGGG GGAAATTTAC TTCTAAAAGG TGAAAAATTT
CTAGTCAAAG TTGAAGGAAT ATCAAAAGGA TTTCTTGTAA AGGACGTAGA GATTGCAGCT
ACATCAAGTA TTATAGAAAA AAAATCAAAG CTTGGTGCAC ATCCAGGAAC AGAACAAGAT
TACGACAAAT TATTGTATAC ATATCTAACA AAGAATAATG CATACATCTG CATTTTTTCA
GATAAAGGAA AAGGCGGTAT CCCATATCAA TCACAAAGTG AGAAAACAAT TTGTGCAGTA
TATGATGAAT TATCTGCAGT TTCTTGTTAT GAGACCATAA AGCAAGGATA CGATACAAAG
GTAATAGTCT GTTATAGACA AAAATCAGAG TTGATGAACC TAGCTAAAAC ACTAAACCAA
ATAATTCCAA GACTAGTTCA AGAGAAAATC GAATTAGAGT TCTTTCATCT GAAAATAAAT
CCAAAAGGTG TCAAAAATTA TCTAACATAT GTAAATTCAG TTGTAGAAAT AATGCTACAG
TCTTCAATTA AACGAGTGTC TTTAGCTATA TCACCACAAA TATTTTCATC AGATTTTCTG
GATAACGCAC TAAAACTAGT ATTTTCAAAA AAGAAGATTC CCCTAGTTCC ATTAGCTGGA
GTAGACACAA ACTTGTTTGA TGAAGCAAAA GAGATTGGAC TGGAGAGAAA TATCAAGAAA
TTAGAAAATA TTGCAGCCAT TAGTTCAGAT GAAATTCCTG TTTTTGTGAA AAAAGAGGTA
GAAAAAGCAC TCAAAACAAA AAAAGTGATT TCCATTCAGG CTGGACCAAA CAATGTACAT
GATATTTTGG ATTCGCTAGA ATAG
 
Protein sequence
MITRSFVLNY KQVDFKVENM NETSYVVVFP TIFSKNKIPQ LISNIKKILK IKNQEFKSVK 
RDGEIILVDA NDPVFASSAI NMLFGIQEVA IARQKKNDYQ EIVSEITSVG GNLLLKGEKF
LVKVEGISKG FLVKDVEIAA TSSIIEKKSK LGAHPGTEQD YDKLLYTYLT KNNAYICIFS
DKGKGGIPYQ SQSEKTICAV YDELSAVSCY ETIKQGYDTK VIVCYRQKSE LMNLAKTLNQ
IIPRLVQEKI ELEFFHLKIN PKGVKNYLTY VNSVVEIMLQ SSIKRVSLAI SPQIFSSDFL
DNALKLVFSK KKIPLVPLAG VDTNLFDEAK EIGLERNIKK LENIAAISSD EIPVFVKKEV
EKALKTKKVI SIQAGPNNVH DILDSLE