Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0517 |
Symbol | |
ID | 5774068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 463382 |
End bp | 464338 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641316150 |
Product | zinc finger TFIIB-type domain-containing protein |
Protein accession | YP_001581851 |
Protein GI | 161528025 |
COG category | [K] Transcription |
COG ID | [COG1405] Transcription initiation factor TFIIIB, Brf1 subunit/Transcription initiation factor TFIIB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAC TAGAAGAACA AAACTGTCCT GAATGTCAAG CAACAGTAGT TAATGACATG CAGAATGGTG AATTAATCTG CTCTGGTTGC GGTGTAGTAG TCGCAGATCA AGTAGCAGAT TATGGTCCAG AAACAATTAG TTCGAATCTT GAAGACAAGA TGAAATTAGC AAGAGCAACC GGACAAACAA CATATTCCCA ACACGATTTG GGAATTACGA CTGAGATAGC AATTAGTACA AAGGACTTTA GTGGAAAAAC AATCAACCAC GAAGTTGCAA ACCAAATGCA CAATCTCAGA AAGTGGCAAC AAAGAGTAAG AGTATCCTCA CCAAAAGAGA GACGACTTGC AAATGTTTTA ACAAAAATGG GAGAAACATG TGACGGTCTA GGTCTTTCAA AAAATGTGTT GGAGACTGCT TCTATGATAT ACAGAAACTT GGACGGACAT GTTGATGTGA AGGGAAAATC AGTAGTAAGC ATTACAGCAG CTACAATTTA CATGGCATGC AAACAATGTG ATGTAGTAAG ATCACTAGAA GAAATTATTC GTGGAATTTG TCCACCAAAA GACGTAAAAT CAAAAACAAA ACTTGCAGCA AGATACTACA GAACCATGGT TATGGAAATG GGACAACTAA CTGCTCCAGT AGTAACTATG GACAAATACA TCTCAAAGAT AGCAAACATG ACACAAACTG AGGTTAGAGT AGAGAGACTA GCCTTGGAAA TTGCAGAAAA AACAAAAGAC AGTAGTATTG CTGATGGAAA GGCTCCAAAT GGAATTGCTG CAGCATACCT GTATGTGTCA TCAGTCCTGC TTGGTCAAAA CGTACTCCAA AGAGACGTTT CAAGCATTGC AGGAGTAACT GAAGTTACTA TCAGAAATAG ATGTAAAGAG ATTCTAACAA ATTACAAACT CAAAATTACT TTGAGACCAT CTCTGGCCAA ATATTAA
|
Protein sequence | MNALEEQNCP ECQATVVNDM QNGELICSGC GVVVADQVAD YGPETISSNL EDKMKLARAT GQTTYSQHDL GITTEIAIST KDFSGKTINH EVANQMHNLR KWQQRVRVSS PKERRLANVL TKMGETCDGL GLSKNVLETA SMIYRNLDGH VDVKGKSVVS ITAATIYMAC KQCDVVRSLE EIIRGICPPK DVKSKTKLAA RYYRTMVMEM GQLTAPVVTM DKYISKIANM TQTEVRVERL ALEIAEKTKD SSIADGKAPN GIAAAYLYVS SVLLGQNVLQ RDVSSIAGVT EVTIRNRCKE ILTNYKLKIT LRPSLAKY
|
| |