Gene Pars_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1976 
Symboltfb 
ID5054452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1770456 
End bp1771457 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content54% 
IMG OID640469523 
Producttranscription initiation factor IIB 
Protein accessionYP_001154175 
Protein GI145592173 
COG category[K] Transcription 
COG ID[COG1405] Transcription initiation factor TFIIIB, Brf1 subunit/Transcription initiation factor TFIIB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.525385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGTA CAAGCCTACC TTCTTCCGGC AAGCCCCTAA AGCTCCGCAT AAATCGAGAT 
AGTGAGGGGT ATTTAAGTCT TGTTACCGAG TCGGGTGAGA TCTACCGCTG TCCCATATGC
GGCAATGACA GATTTGTCTA CAACTACGAG AGGGGTGAAG TTGTCTGTAT AGTGTGCGGT
GCAGTTGTGC AGGAACAGCT ACTTGACCTC GGCCCAGAGT GGAGGGCTTT CACATCAGAG
GAGAAGGGCC AGAGGGCGCG CACTGGCGCG CCGCTTACTA GGCTCATCTC TGAGGCGTTG
ACCACAGTTA TCGATTGGCG AGACAAGGAC GTCTCCGGTA GGGAGCTGGA CATAAAGAGG
AAGTTGGAGG TAATAAGGCT GAGGAAGTGG CAGACCAGGG CCCGTGTGCA GACCTCCTAC
GAGAGGAACT TTATACAAGC GGCGCAGGAG CTAGAGAGAT TAAAGAGCTC CATGGGCGTG
CCAAGGCCGT GCGTCGAGCA AGCCCTCGAG ATATACAGGC AGGCACTTGA AAAAGAGCTG
GTGAGGGGCA GATCTGTCGA GGCGATGGCC GCGGCGGCGC TCTACATGGC GTGCCGCATG
ATGAGGATGC CGAGACCACT GGACGAACTC GTGAGGTACA CAAAGGCATC TAGAAGAGAA
GTGGCGAGGT GCTACAGGTT GTTGCTAAGA GAGCTGAACG TAAAGGTGCC TATAAGCGAC
CCTGTACTCT ACATTTCCAG AATAGCAGAG CAACTGAAGC TCAGCGGCGA AGTTGTAAAG
GCGGCAATCG ACATTCTGCA GAGGGCTAAA AAGGCCGGCA TCACGGCGGG GAAGGACCCA
GCGGGTTTAG CCGCTGCCGC GGTTTATATA GCCTCGCTGA TGCATGGTGA TAACAGGACT
CAGAAGGACT TCGCTGTGGC GGCCGGCGTG ACGGAGGTTA CTGTGAGAAA TAGGTACAAG
GAACTGGCAA AGGCGCTTAA TATAAAGGTC CCTGTAAAGT AA
 
Protein sequence
MSSTSLPSSG KPLKLRINRD SEGYLSLVTE SGEIYRCPIC GNDRFVYNYE RGEVVCIVCG 
AVVQEQLLDL GPEWRAFTSE EKGQRARTGA PLTRLISEAL TTVIDWRDKD VSGRELDIKR
KLEVIRLRKW QTRARVQTSY ERNFIQAAQE LERLKSSMGV PRPCVEQALE IYRQALEKEL
VRGRSVEAMA AAALYMACRM MRMPRPLDEL VRYTKASRRE VARCYRLLLR ELNVKVPISD
PVLYISRIAE QLKLSGEVVK AAIDILQRAK KAGITAGKDP AGLAAAAVYI ASLMHGDNRT
QKDFAVAAGV TEVTVRNRYK ELAKALNIKV PVK