Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0543 |
Symbol | |
ID | 5773115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 482942 |
End bp | 484270 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316176 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001581877 |
Protein GI | 161528051 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.762964 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACTC AAATGACTGC AGCAAGACGC GGTGTTGCAA CTGATGAGAT GAAACAAGTT GCAAAAGATG AGGATGTTAC TCTTGATTGG TTAATCCCAA AAATTGCAAA GGGCTCTATA ATTATTCCAA GTAATAACTG TAGACCTCAA AAAATTCATA ATGTTGGAAT TGGTAAGGGT TTGAAAACCA AAGTAAACGT AAACATTGGA ACTTCTACAT TAAACGTAAA TCTAGAAGAA GAGATTGAAA AAGCAAAAGT TGCTGTAAAA TATCATGCAG ATACTATGAT GGATCTTAGT GATGGCGGTG ATGTAAAACA CATAAGAAAA ACTCTGTTAG AAACTGCTCC AATTACTTTT GGCACTGTTC CAATTTATGA AGCATACAAC TATGGTGTTG AAGTACACAA AAACCCATTG AATTTAACTG AAGATGATTA TCTAAACGCA TTTGAAAACA ATGCTAAAGA TGGCGTTGAT TATACTACAA TTCACTGTGG AATTACAAAA GACATTGCAA AAAGAATTCT AAAGGTTCAG AGATATGGTG GTGTTGTCAG TAAAGGAGGC ACCATAACTG CCGCATGGAT GTTAAAACAT GACAAAGAAA ATCCCTACTT GACTCATTAT GATTATCTTG TGGAGATGGC AAAAAAATAT GATGTGACTT TTAGCCTTGG AGATGCTCTT AGGCCAGGCT CAATTTTGGA CTCTCATGAT GAATTACAAG TTCAAGAAAT GATTAACATC TCTCAGCTAA CAAAACGTGC ACATGAACAA GATGTTCAAG TGATGGTTGA AGGTCCAGGC CATGTACCAT TAAACGAAGT TGCAGCAAAT GTTAGACTGG CAAAGTCTTT GATTGGAGAT GTTCCATATT ATGTTCTAGG ACCTTTAGTA ACAGATGTTG CATCTGGACA TGATCATATT GCAAGTGCAA TTGGTGCCGC TGTATCTGCA AGTGAAGGTG TTGATCTTTT GTGTTATCTT ACTCCTTCAG AACATCTTGC ATTACCAAAC GCTGAAGAAG TAAAGGCTGG ATTAATTGCA TATCGAATTG CAGCACATGC AGGTGATCTT GTAAAAATTC GTGATAAAGC GATCAAATGG GATATGGAGA TGACTGAAGC TCGACGTACA CTAGATTGGG AAAAACAACT TGCATTGTCT ATTGATCCTG AAGAAGCTGC TAAAATTCAC AGTAGAACAG GCCAACACCC TGGCAATAAT GTTCCTTGTA CTATGTGCGG AGGTGCATGT GTTTACATGA TGTTGCCTCA ACAAAAAAAA TACGAGAAAG AAAACGAAAA CCTACAACAA ATTGAATAA
|
Protein sequence | MATQMTAARR GVATDEMKQV AKDEDVTLDW LIPKIAKGSI IIPSNNCRPQ KIHNVGIGKG LKTKVNVNIG TSTLNVNLEE EIEKAKVAVK YHADTMMDLS DGGDVKHIRK TLLETAPITF GTVPIYEAYN YGVEVHKNPL NLTEDDYLNA FENNAKDGVD YTTIHCGITK DIAKRILKVQ RYGGVVSKGG TITAAWMLKH DKENPYLTHY DYLVEMAKKY DVTFSLGDAL RPGSILDSHD ELQVQEMINI SQLTKRAHEQ DVQVMVEGPG HVPLNEVAAN VRLAKSLIGD VPYYVLGPLV TDVASGHDHI ASAIGAAVSA SEGVDLLCYL TPSEHLALPN AEEVKAGLIA YRIAAHAGDL VKIRDKAIKW DMEMTEARRT LDWEKQLALS IDPEEAAKIH SRTGQHPGNN VPCTMCGGAC VYMMLPQQKK YEKENENLQQ IE
|
| |