Gene Nmar_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1198 
Symbol 
ID5773259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1092421 
End bp1093617 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content30% 
IMG OID641316842 
ProducttRNA pseudouridine synthase TruD 
Protein accessionYP_001582532 
Protein GI161528706 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATACCTG ATTTAGATTC TAAAATAGGA ATTTCTGTTT ATAGTACAAA ATTTGACGGA 
ATTGGGGGAA AAATTCGTAC TACTCCTGAA GACTTTGAAG TCTCTGAAAA AATTTTAGAA
AAAACTCTGA ATTCAATTAA TCAAGAAGAA GGATATGCTG TTTACAAATT AAAAAAGAAA
AGAATTGATA CAAATCATGC ACTATCTGAT ATTTTTAGAA AGAAAGGTCT TAGACTAAAG
TCTCTTGGAT TAAAGGATGC ATCTGCAATC ACTGAACAGT TTGTTTGTTC TGGAAATAAA
GGTAAATCAA TTGAAGATTA TTCAACTGAA AAATATTCTT TGAAAAAAAT TGGCTTTGTA
AAAAAACCTC TTTCAAAAAA AGATATGATT GGGAATCATT TTAAAATCAA AATCTCTGAT
TGTACCAATA AACTATCTTC ATTTGAAGAG TTTAACAATG TCTTGAATTT CTATGGGTAT
CAAAGATTTG GCTCAAAGAG GCCTGTAACT CATTTGATAG GTAAAGCAAT ACTTCAAAGA
GATTTTGATA AAGCAGTTGA ACTTGTTTTG TCTTTTACTT CTGATTATGA TTCAAAAGAA
AATAATGAAA TTCGACAAAA ACTTGTTGAT AAACAAAACT ACAATCAATA TTTTGAGAAA
ATACCAAAAC AAATGGATAT TGAAAGAATT GTCCTAAAAG AAATGATTGA ACATGATGAT
GCATTTCGTG CAATACGTGC AATTCCCGTA TCTTTGAGAA GATTTTACAT TCAGGCATAC
CAGTCGTTTA TTTTCAATAA ATCTCTGAGT ACTGCATTTA CTGATGGTGA AAATATGTTT
GAATCTGAAT CTGGTGATGT TTGTTTTGAT TTTAATGGAA TCATTGGAAA ATTTGTAAAG
GGATTAGATC AAAAATTGGC TATACCTTTT GTTGGATATT CATATTACAA AAAAACAAGA
TTTGATTATC ATATATCACA AATAATGCAG CAAGAAGAAA TAACTCCCAA AGACTTCTTT
ATCAAAGAGA TGCAAGAAGT AAGCAGTGAG GGAGGATTTC GACAAGCTGC AATAGATTGT
TCTGATTACT CGTCTCGTGA TGATGTTGTA GAATTTACTT TGTCAAGGGG ATCTTTTGCA
ACAATTTTGT TGAGAGAGAT TATGAAACCA TCTGATCCTA TTTATGCTGG TTTTTGA
 
Protein sequence
MIPDLDSKIG ISVYSTKFDG IGGKIRTTPE DFEVSEKILE KTLNSINQEE GYAVYKLKKK 
RIDTNHALSD IFRKKGLRLK SLGLKDASAI TEQFVCSGNK GKSIEDYSTE KYSLKKIGFV
KKPLSKKDMI GNHFKIKISD CTNKLSSFEE FNNVLNFYGY QRFGSKRPVT HLIGKAILQR
DFDKAVELVL SFTSDYDSKE NNEIRQKLVD KQNYNQYFEK IPKQMDIERI VLKEMIEHDD
AFRAIRAIPV SLRRFYIQAY QSFIFNKSLS TAFTDGENMF ESESGDVCFD FNGIIGKFVK
GLDQKLAIPF VGYSYYKKTR FDYHISQIMQ QEEITPKDFF IKEMQEVSSE GGFRQAAIDC
SDYSSRDDVV EFTLSRGSFA TILLREIMKP SDPIYAGF