Gene Nmar_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1733 
Symbol 
ID5773739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1589437 
End bp1590453 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content29% 
IMG OID641317387 
ProductRNA-3'-phosphate cyclase 
Protein accessionYP_001583067 
Protein GI161529241 
COG category[A] RNA processing and modification 
COG ID[COG0430] RNA 3'-terminal phosphate cyclase 
TIGRFAM ID[TIGR03399] RNA 3'-phosphate cyclase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.254233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTT TAAAAATTAA TGGAGCACAT GGTGAAGGAG GAGGACAAAT AATTCGTTCT 
GCAATTACTC TTTCATGTAT TACAAAACAA CCAATTCACA TTGAGAATAT TAGGAAAAAT
AGAAAAGTTT CTGGATTAAA ACCTCAACAT CTTACAGCAA TTAAAATTCT AAAAAAAATT
TCAGATTGTA AAGTAATTGG TGATGAAATT GGTTCTACGG AATTAAAATT CATTCCAGGA
GAAATTAAAA GTTCAAAATT ATCTGAAGAT GTAGGAACAG CAGGAAGTAT TTCACTAATT
TTACAAGTTT TAATTCCAAT AGTAGCAATA TCACAAAAGA ATCTTGAGAT TTCAATCAAA
GGTGGTACAG ATGTACAATG GAGTCCAACA ATGTTCTATA CACAACATAT TTTGAGAGAA
GTATATTCAA GAATGGGGAT TAATTTTTCA TTTGAATTAA AGAAGAGGGG ATACTATCCA
AAAGGAAATG GAGAAGTTAA TTTAGAAATT AATCCATCAA ACGTTAAAGC AATTTCATTA
TCAAAAAGAA AAACAAATCG GGTAAAAATT CTATGTACAT TTTCAAAAAT TCCAAATGAA
AAAATTGAAA GTGAAATTAA AAAAATTAAA GAAAAACTAA ATGAAAGTTT TGTTGTTGAC
GTAGAAATCA AAGAAGAAGC ATTAGACTCT GGTGCGTCTT TATTAGTTTA CAGTATTGAT
GAGAATTCAA TTATAGGAAT TGATTCATTA TTTGATAAAA AAATAGAGAG TTTTGACGTA
GATCTTGATG GATTTTTAGA AGATATAGCA GTGGATGAGA ATCTGGCAGA CATGATTGTA
GTTCCAGCAA GTGTGGCCAA AGGCAAGACA ATCTTTCAAG TCAAAAAAAT TACAAAACAC
TTGGAAACCA ACTTGTTTGT AACATCAAAA ATTTCTGGTT GCAAATATGG AATAGGCAAA
TTACCTAATG GTTTTGAAGT AATTATTGAA GGAACATCAT ACTCCAGCAT CAAGTAA
 
Protein sequence
MDFLKINGAH GEGGGQIIRS AITLSCITKQ PIHIENIRKN RKVSGLKPQH LTAIKILKKI 
SDCKVIGDEI GSTELKFIPG EIKSSKLSED VGTAGSISLI LQVLIPIVAI SQKNLEISIK
GGTDVQWSPT MFYTQHILRE VYSRMGINFS FELKKRGYYP KGNGEVNLEI NPSNVKAISL
SKRKTNRVKI LCTFSKIPNE KIESEIKKIK EKLNESFVVD VEIKEEALDS GASLLVYSID
ENSIIGIDSL FDKKIESFDV DLDGFLEDIA VDENLADMIV VPASVAKGKT IFQVKKITKH
LETNLFVTSK ISGCKYGIGK LPNGFEVIIE GTSYSSIK