Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1340 |
Symbol | |
ID | 5774208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1227903 |
End bp | 1228823 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641316985 |
Product | transcription factor TFIIB cyclin-related protein |
Protein accession | YP_001582674 |
Protein GI | 161528848 |
COG category | [K] Transcription |
COG ID | [COG1405] Transcription initiation factor TFIIIB, Brf1 subunit/Transcription initiation factor TFIIB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000744308 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGTCGG GAAGGTTTTT TGATCATATC ATGAGTTCAC AAGAAATCAA ATGTCCTAGA TGTGGAAAAA ACGCGCTAGT CACTGATGTT GAATCGTCAG AAATATTTTG TTCAAATTGC GGTATTGTGG TTGAAGAGAA AACTAATGAC CGCAGACCCG AAAGAGCATT TGCAAATTCA ACTACCGATA AATCCCATAC TGGCGACAAG ACATCTTTGA CAAGACATGA CAGGGGACTT AGCACTGTGA TTAACCCCTT TGACAAAGAT TCTGCTGGAA GTCCCTTGTC TGCGTCAATG AAATCATCCA TGACACGACT CCGGAAATGG GATAATCGAA GTCGCATAAA GACCAATGAT GACAGAAATT TGCAACAAGC ACTGCTGGAA TTGTCAAAAA TGAAAGAAAA ATTGTCCTTG TCTGACGCAA TTGCTGAAAA AGCATCGTAC ATCTATAGAA AGGCACTGGA GAAAAAACTG GTCAAGGGGC GTTCTATTGC ATCTCTTGTT GCAGCTTGTC TTTATGCCGC ATGCCGTGAA TCAGAAGCTC CCAGGACACT CCGAGAAGTT GCAGCATCCA TAGGAATTAA ACGCAAAGAA ATCTCTGCAA CATACCGGCT CATATTCAAA GAGTTAGACC TTAAAATGCC CGTAATTGAC TCTGTCTCCT GTATTGCAAA AATTGCAAGC AATGCAGAAC TGTCTGAGAA AACAAAAAGA TATGCCATAA AAATTCTAAA AAAGGCAGAA AAACAAAACA TGTCTGCAGG GAAGCATCCT ATGGGACTGG CTGCCTCTGC ATTATACCTG GCGTCAATAG ATTTGGAGGA ATTTAGGACT CAAAAAGAAA TTGCAGATGC AGCGGGAATC ACAGAGGTTA CTGTCAGAAA CAGATGCAAA GGTCTCAAAC AAATGATCTA A
|
Protein sequence | MSSGRFFDHI MSSQEIKCPR CGKNALVTDV ESSEIFCSNC GIVVEEKTND RRPERAFANS TTDKSHTGDK TSLTRHDRGL STVINPFDKD SAGSPLSASM KSSMTRLRKW DNRSRIKTND DRNLQQALLE LSKMKEKLSL SDAIAEKASY IYRKALEKKL VKGRSIASLV AACLYAACRE SEAPRTLREV AASIGIKRKE ISATYRLIFK ELDLKMPVID SVSCIAKIAS NAELSEKTKR YAIKILKKAE KQNMSAGKHP MGLAASALYL ASIDLEEFRT QKEIADAAGI TEVTVRNRCK GLKQMI
|
| |