Gene Nmar_1563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1563 
Symbol 
ID5773372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1431790 
End bp1432785 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content32% 
IMG OID641317216 
Productdiphthamide biosynthesis protein 
Protein accessionYP_001582897 
Protein GI161529071 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTGTAA TAGATGAGAA AAGGATTTTT CAGGAGATAG AAGAAAAAAA TCCTGCATCA 
GTTTCATTAA ATGGGCCAGA TGGAATTTTG CCACAAGTAC AAGAAACTGC AAAAAATATT
ACAAAAAAAT TTGGCATTCC AGCATATGTT CTAGCTGATA CAACTTGGGG AACGTGTGAT
TTGAATTCAA ATGGTTCCAA AGTTCTTGGT GCAGAAATTC AATTCAACAT AGGTCATACA
ATAAACACTG AGACATATGA AAAAAATTTG ATTCTAATTG ATGCTTATGA TGATGTAGAA
TTTGACAGTG TAGCAAAAAA ATGTGCAGAA TTACTGAAAG GAAAAGTAAT TTCTCTAGTA
ACAGATAGTC AACACTTGCA TCAAGTAGAT AAAGTTGAAA AAATTTTAAC AGAAAATGGA
ATTACAGTAA AGATTGGAAA AGGTAAAGGA CAGTTAAATG ATGGACAAGT ATTTGGTTGT
GAATTTTATC CTGCAACAGA TCTAAAAAAA GAAGTAGATG CATACGTTTT CTTGGGACAA
AGTAATTTTC ATGCATCGGG AATTGCATTA TCCACAAATC TGCCAACATA TGTTTTAGAT
CCTTACTTTA ATGAAGTAAG AGAGGTTACA GATTTTGCAC AGAAATTAAA AAAGAAAGCT
ACTCTTGCAA TATACAAAGC AGCTGAAGCA AAAACTTTTG GAGTGATTGT GGGATTGAAA
GAGGGTCAAT TATCAAAGGT TTTTGCATTA AAAATCAAAG AAGAGTTGGA GGCAGAAGGA
AAAGAAGTTC AATTATTTGC ATTGACAGAC ATAACAAATG ACAGATTAAG AAATCTAAAA
GGAATTGATG CTTTTATTCA GGTTGCATGT CCTAGAATTT CTACAGACAA TCAGTTTGAC
AAGCCTGTTT TATCATCACC TCAGGCTAAT GCATTGTTAA AAATTCTTAG AAATGAAAGT
ATTGAGGGGT ATTTAGAAAT CCCACATTGG TTATGA
 
Protein sequence
MIVIDEKRIF QEIEEKNPAS VSLNGPDGIL PQVQETAKNI TKKFGIPAYV LADTTWGTCD 
LNSNGSKVLG AEIQFNIGHT INTETYEKNL ILIDAYDDVE FDSVAKKCAE LLKGKVISLV
TDSQHLHQVD KVEKILTENG ITVKIGKGKG QLNDGQVFGC EFYPATDLKK EVDAYVFLGQ
SNFHASGIAL STNLPTYVLD PYFNEVREVT DFAQKLKKKA TLAIYKAAEA KTFGVIVGLK
EGQLSKVFAL KIKEELEAEG KEVQLFALTD ITNDRLRNLK GIDAFIQVAC PRISTDNQFD
KPVLSSPQAN ALLKILRNES IEGYLEIPHW L