Gene Nmar_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0223 
Symbol 
ID5774582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp197758 
End bp198627 
Gene Length870 bp 
Protein Length289 aa 
Translation table11 
GC content31% 
IMG OID641315843 
Productformyl transferase domain-containing protein 
Protein accessionYP_001581557 
Protein GI161527731 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0788] Formyltetrahydrofolate hydrolase 
TIGRFAM ID[TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent
[TIGR00655] formyltetrahydrofolate deformylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAT CTTCAAAAAA CAAAGTCATG AAAAAGACAG TAGTTGGAAT AACAGTTGTT 
GGTAAAGATA GAGAAGGTAT TGTAGCTTCA TTTACAAATT TTGCATTCTC AAAAGGAGGA
AACATTGAGA AAGTAAATCA GAATGTAATC AAGGGCCTTT TTGGAATGTA TCTAGAAGTT
TCTTTTGCAA AAGCAGTTAA TGTAAAAAAA TTTGATGCAG AAATTCAAAC TTTGGCTAAA
AAAGAAAAGA TGGATGTAAG TACTCATCAT GAAACAAATT CGCAAAAAAA TATTGCAGTT
TTTGTGACAA AAGAACCATT ATGTCTACAA ACAATTCTTG CAAAATCAAA ATCACTAAAA
GGAAAAATCT CAGTAATTAT AGGTACTGAA AAGACACTTG AATCATTAGC AAAGAAAGCA
AAGATTCCAT TTGTTGCAGT TGAAGAGAAG AATCAACAAA AGGCAGAAGA AAAAATTATT
CAGATTTGTA AAAAATACAA TATTGATTTG ATCTCACTTG CAAGATACAT GAGAATTCTT
AGTCCTAACT TTGTTTGGAG ATATCCAAAT AGAATTATCA ACATACATCC ATCATTATTG
CCAGCATTTC CTGGTGCACT AGCATATGCA CAAGCTTATG AAAGAGGTAC AAAGATTGTA
GGAGTTACAT CCCATTACGT AACTGAAAAC TTGGATCAAG GACCAATAAT TTTCCAAGAT
TCTTTCAAAG TAGATCCAAA TGATACTTTA GAGAAAATAA AATCAAAGGG GCAAAAATTA
GAGGCAGATA CATTATTCAA AGCAATGAAA ATGCATTTAG AAAACAAACT AGATGTTCGT
TGGAGAAAGG TTCACATCAA ATCAAAGTGA
 
Protein sequence
MRKSSKNKVM KKTVVGITVV GKDREGIVAS FTNFAFSKGG NIEKVNQNVI KGLFGMYLEV 
SFAKAVNVKK FDAEIQTLAK KEKMDVSTHH ETNSQKNIAV FVTKEPLCLQ TILAKSKSLK
GKISVIIGTE KTLESLAKKA KIPFVAVEEK NQQKAEEKII QICKKYNIDL ISLARYMRIL
SPNFVWRYPN RIINIHPSLL PAFPGALAYA QAYERGTKIV GVTSHYVTEN LDQGPIIFQD
SFKVDPNDTL EKIKSKGQKL EADTLFKAMK MHLENKLDVR WRKVHIKSK