Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1976 |
Symbol | tfb |
ID | 5054452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1770456 |
End bp | 1771457 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469523 |
Product | transcription initiation factor IIB |
Protein accession | YP_001154175 |
Protein GI | 145592173 |
COG category | [K] Transcription |
COG ID | [COG1405] Transcription initiation factor TFIIIB, Brf1 subunit/Transcription initiation factor TFIIB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.525385 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGTA CAAGCCTACC TTCTTCCGGC AAGCCCCTAA AGCTCCGCAT AAATCGAGAT AGTGAGGGGT ATTTAAGTCT TGTTACCGAG TCGGGTGAGA TCTACCGCTG TCCCATATGC GGCAATGACA GATTTGTCTA CAACTACGAG AGGGGTGAAG TTGTCTGTAT AGTGTGCGGT GCAGTTGTGC AGGAACAGCT ACTTGACCTC GGCCCAGAGT GGAGGGCTTT CACATCAGAG GAGAAGGGCC AGAGGGCGCG CACTGGCGCG CCGCTTACTA GGCTCATCTC TGAGGCGTTG ACCACAGTTA TCGATTGGCG AGACAAGGAC GTCTCCGGTA GGGAGCTGGA CATAAAGAGG AAGTTGGAGG TAATAAGGCT GAGGAAGTGG CAGACCAGGG CCCGTGTGCA GACCTCCTAC GAGAGGAACT TTATACAAGC GGCGCAGGAG CTAGAGAGAT TAAAGAGCTC CATGGGCGTG CCAAGGCCGT GCGTCGAGCA AGCCCTCGAG ATATACAGGC AGGCACTTGA AAAAGAGCTG GTGAGGGGCA GATCTGTCGA GGCGATGGCC GCGGCGGCGC TCTACATGGC GTGCCGCATG ATGAGGATGC CGAGACCACT GGACGAACTC GTGAGGTACA CAAAGGCATC TAGAAGAGAA GTGGCGAGGT GCTACAGGTT GTTGCTAAGA GAGCTGAACG TAAAGGTGCC TATAAGCGAC CCTGTACTCT ACATTTCCAG AATAGCAGAG CAACTGAAGC TCAGCGGCGA AGTTGTAAAG GCGGCAATCG ACATTCTGCA GAGGGCTAAA AAGGCCGGCA TCACGGCGGG GAAGGACCCA GCGGGTTTAG CCGCTGCCGC GGTTTATATA GCCTCGCTGA TGCATGGTGA TAACAGGACT CAGAAGGACT TCGCTGTGGC GGCCGGCGTG ACGGAGGTTA CTGTGAGAAA TAGGTACAAG GAACTGGCAA AGGCGCTTAA TATAAAGGTC CCTGTAAAGT AA
|
Protein sequence | MSSTSLPSSG KPLKLRINRD SEGYLSLVTE SGEIYRCPIC GNDRFVYNYE RGEVVCIVCG AVVQEQLLDL GPEWRAFTSE EKGQRARTGA PLTRLISEAL TTVIDWRDKD VSGRELDIKR KLEVIRLRKW QTRARVQTSY ERNFIQAAQE LERLKSSMGV PRPCVEQALE IYRQALEKEL VRGRSVEAMA AAALYMACRM MRMPRPLDEL VRYTKASRRE VARCYRLLLR ELNVKVPISD PVLYISRIAE QLKLSGEVVK AAIDILQRAK KAGITAGKDP AGLAAAAVYI ASLMHGDNRT QKDFAVAAGV TEVTVRNRYK ELAKALNIKV PVK
|
| |