Gene Nmar_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0543 
Symbol 
ID5773115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp482942 
End bp484270 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content37% 
IMG OID641316176 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001581877 
Protein GI161528051 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.762964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTC AAATGACTGC AGCAAGACGC GGTGTTGCAA CTGATGAGAT GAAACAAGTT 
GCAAAAGATG AGGATGTTAC TCTTGATTGG TTAATCCCAA AAATTGCAAA GGGCTCTATA
ATTATTCCAA GTAATAACTG TAGACCTCAA AAAATTCATA ATGTTGGAAT TGGTAAGGGT
TTGAAAACCA AAGTAAACGT AAACATTGGA ACTTCTACAT TAAACGTAAA TCTAGAAGAA
GAGATTGAAA AAGCAAAAGT TGCTGTAAAA TATCATGCAG ATACTATGAT GGATCTTAGT
GATGGCGGTG ATGTAAAACA CATAAGAAAA ACTCTGTTAG AAACTGCTCC AATTACTTTT
GGCACTGTTC CAATTTATGA AGCATACAAC TATGGTGTTG AAGTACACAA AAACCCATTG
AATTTAACTG AAGATGATTA TCTAAACGCA TTTGAAAACA ATGCTAAAGA TGGCGTTGAT
TATACTACAA TTCACTGTGG AATTACAAAA GACATTGCAA AAAGAATTCT AAAGGTTCAG
AGATATGGTG GTGTTGTCAG TAAAGGAGGC ACCATAACTG CCGCATGGAT GTTAAAACAT
GACAAAGAAA ATCCCTACTT GACTCATTAT GATTATCTTG TGGAGATGGC AAAAAAATAT
GATGTGACTT TTAGCCTTGG AGATGCTCTT AGGCCAGGCT CAATTTTGGA CTCTCATGAT
GAATTACAAG TTCAAGAAAT GATTAACATC TCTCAGCTAA CAAAACGTGC ACATGAACAA
GATGTTCAAG TGATGGTTGA AGGTCCAGGC CATGTACCAT TAAACGAAGT TGCAGCAAAT
GTTAGACTGG CAAAGTCTTT GATTGGAGAT GTTCCATATT ATGTTCTAGG ACCTTTAGTA
ACAGATGTTG CATCTGGACA TGATCATATT GCAAGTGCAA TTGGTGCCGC TGTATCTGCA
AGTGAAGGTG TTGATCTTTT GTGTTATCTT ACTCCTTCAG AACATCTTGC ATTACCAAAC
GCTGAAGAAG TAAAGGCTGG ATTAATTGCA TATCGAATTG CAGCACATGC AGGTGATCTT
GTAAAAATTC GTGATAAAGC GATCAAATGG GATATGGAGA TGACTGAAGC TCGACGTACA
CTAGATTGGG AAAAACAACT TGCATTGTCT ATTGATCCTG AAGAAGCTGC TAAAATTCAC
AGTAGAACAG GCCAACACCC TGGCAATAAT GTTCCTTGTA CTATGTGCGG AGGTGCATGT
GTTTACATGA TGTTGCCTCA ACAAAAAAAA TACGAGAAAG AAAACGAAAA CCTACAACAA
ATTGAATAA
 
Protein sequence
MATQMTAARR GVATDEMKQV AKDEDVTLDW LIPKIAKGSI IIPSNNCRPQ KIHNVGIGKG 
LKTKVNVNIG TSTLNVNLEE EIEKAKVAVK YHADTMMDLS DGGDVKHIRK TLLETAPITF
GTVPIYEAYN YGVEVHKNPL NLTEDDYLNA FENNAKDGVD YTTIHCGITK DIAKRILKVQ
RYGGVVSKGG TITAAWMLKH DKENPYLTHY DYLVEMAKKY DVTFSLGDAL RPGSILDSHD
ELQVQEMINI SQLTKRAHEQ DVQVMVEGPG HVPLNEVAAN VRLAKSLIGD VPYYVLGPLV
TDVASGHDHI ASAIGAAVSA SEGVDLLCYL TPSEHLALPN AEEVKAGLIA YRIAAHAGDL
VKIRDKAIKW DMEMTEARRT LDWEKQLALS IDPEEAAKIH SRTGQHPGNN VPCTMCGGAC
VYMMLPQQKK YEKENENLQQ IE