Gene Nmar_0406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0406 
Symbol 
ID5773471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp364893 
End bp365891 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content36% 
IMG OID641316035 
Productpseudouridylate synthase subunit TruB 
Protein accessionYP_001581740 
Protein GI161527914 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00425] rRNA pseudouridine synthase, putative
[TIGR00431] tRNA pseudouridine 55 synthase
[TIGR00451] uncharacterized domain 2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.189117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTAA AACAATTAGA GAATCTAATT GAAGTTGATC AAGATATTAC TGATGATGCA 
TATGGCACAT ACTATGATAA AAGAACAATA GAACAATTAC TAAACTATGG AATTATTTTA
CTAGACAAAC CTCCTGGACC TACAAGTCAT GAGACAGTAG CATGGACCAA AAGAATTTTG
AAATTACCAA AGATTGGACA TAGTGGAACG CTAGACCCAC AAGTTTCAGG AGTACTTCCT
TTAGGGTTGG GTGAGGCAAC AAAAGCTCTA GGTGTATTGC TGTTTGGACC TAAAGAATAC
CATGCACTAG GACGTGTCCA CTCGCTTCCA TCAAAAGAAA AACTACATGA GGTAATCGAA
TCATTAACTG GAGAAATTTA CCAAAAACCT CCACAACGTT CCGCAGTAGT TAGACAAACA
AGAACTAGAA CAATTTACGA ATTTGAAGTA TTAGAACAAA AAGAAAGACT GTTACTAACT
AGAGTCCTAT GTGAGGCTGG AACATACATT AGAAAATTAT ACTATGATCT AGGTGAAATT
CTAGGACCTG GTGCAACCAT GATTGAGCTT AGAAGAACTA GAGTTGATCA ATTCAGAGAA
ACTGATGGAT TGGTGACCCT CCATGAACTT GCAAACGCAT TTGCTTTATG GGAAGAAAAG
AAAGATGATT CTAAACTAAA GAGTATGATA CAGCCTGTAG AACATGCACT AAGTGAGTTA
AAATCAGTAG TAATTCGTGA TTCTGCAATT GATGCAATGT GCCATGGTGC ACAACTAGCA
ATTCCTGGAA TTTTACAAAT ATCTCCTAGT TTGAACAAAG GTGACATTGT TGGAATTTAC
ACACAAAAAG GAGAAGCAGT TGCATTAGCC GAAGCAACAA TGTCTGGTCA GGAAATTCAA
GATGCAGTAA AAGGATATGC ATTTGAAACA AAGAGAATTA TCATGGCTCC AAACACTTAT
CCTAAAAAAT GGAGAACAAA ACCATCTTCT AAAGAATAA
 
Protein sequence
MTLKQLENLI EVDQDITDDA YGTYYDKRTI EQLLNYGIIL LDKPPGPTSH ETVAWTKRIL 
KLPKIGHSGT LDPQVSGVLP LGLGEATKAL GVLLFGPKEY HALGRVHSLP SKEKLHEVIE
SLTGEIYQKP PQRSAVVRQT RTRTIYEFEV LEQKERLLLT RVLCEAGTYI RKLYYDLGEI
LGPGATMIEL RRTRVDQFRE TDGLVTLHEL ANAFALWEEK KDDSKLKSMI QPVEHALSEL
KSVVIRDSAI DAMCHGAQLA IPGILQISPS LNKGDIVGIY TQKGEAVALA EATMSGQEIQ
DAVKGYAFET KRIIMAPNTY PKKWRTKPSS KE