Gene Pars_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2236 
Symbol 
ID5055833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2003681 
End bp2005033 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content55% 
IMG OID640469789 
ProductTIP49-like protein 
Protein accessionYP_001154434 
Protein GI145592432 
COG category[K] Transcription 
COG ID[COG1224] DNA helicase TIP49, TBP-interacting protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.385068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTA TTAAAATTGA AGAGGTAAAC ACCTCCCTGG AGAGGTTTGC GGCCCACAGC 
CACATTAAGG GGCTGGGAGT TAGAGATGGG AAAGTCCAAT TTGTGGCTGA CGGCTTTGTG
GGGCAGACCG AGGCGCGGGA GGCCGCTTAC ATAATTGTCC AGATGATTAA GGAGGGAAAG
TTCGCAGGGA GGGGTGTACT TATCGTCGGC CCCCCGGGCA CGGGGAAGAC GGCGCTGGCT
TTGGGAATAG CCCGGGAGCT GGGCCCGGAG ACGCCCTTTG TGGCGATCTC CGGCGGCGAG
ATATACTCGC TTGAGGTGAA GAAGTCGGAG TTTTTGATGA GGGCTCTGAG GAGGGCCATA
GGGATTAGGA TTAGGGAGTG GAGGAAGGTA TATGAGGGAG AGCTGAGGTC TATTGATATT
AGATATGGGC GCCACCCCTA CAACCCATAC CTGCAGAGGG TGTTGGGGGC CACCATCAAG
CTGAGGACTC GGGACGAGGA GAAGACGCTG AGGGTGCCGG CTGAGATTGC GCAACAGCTA
ATTGAGCTCG GCGTGGAGGA GGGGGACGTG ATTATGATAG ACGAGGAGAC AGGCGCTGTG
TCGCTGGTGG GCAGAGGCGA GGGCGGCGAG CAATACGACG TGGGTAGGAG GAGGATCGAG
CTCCCCAAGG GGCCCGTCTA CAAGGAGAAG GAGATAGTCA GGTTCTACAC GCTTCACGAT
GTGGACATGT CCCTGGCCAG GCAGAGGGGG CTGATCTCGG CCATGCTGTT CGGCTTTGCC
GAGGAGGTAA AGGAGATCCC AGAAGAGATT AGGAGGCAGA GCGACGAAAT AGTCAAGAAA
GTACTAGAGG AAGGCAAGGC GGAGCTGGTG CCAGGAGTGT TGTTCATCGA CGACGTCCAC
CTCCTCGATA TAGAGAGCTT CTCATTCTTA ATGAGGGCTA TGGAGACGGA GTTTGCCCCC
ATCATAATTA TGGCCACCAA TAGGGGGATT GCAAGGATTA GGGGTACTGA CATAGAGGCG
CCGCACGGAA TCCCCCAGGA CATGCTGGAT AGACTCGTCA TTATTCGTAC TCGGCCCTAT
ACGGCTGAGG AGATACGCGA GATTATCTCC ATAAAGGCGA ATGAGCAGAA GGTACCGCTG
ACCAAAGAGG CCCTTGATCT CCTCACATCA ATAGGCGTAG ACCACTCGCT GAGGTACGCC
CTCCAGTTGT TGACGCCGGC TTATATAGTC GCAAAAGAAC GCGGCAAGGG GTCTGTGGGC
AGAGAGGAGA TAGAAGAGGT GAGGAGGCAT TTCGTTTCGG TGAAGGAGTC CGTGGAGTAC
GTGAAGTCGC TGGAGGAGAA GTTTTTAAGA TAG
 
Protein sequence
MSSIKIEEVN TSLERFAAHS HIKGLGVRDG KVQFVADGFV GQTEAREAAY IIVQMIKEGK 
FAGRGVLIVG PPGTGKTALA LGIARELGPE TPFVAISGGE IYSLEVKKSE FLMRALRRAI
GIRIREWRKV YEGELRSIDI RYGRHPYNPY LQRVLGATIK LRTRDEEKTL RVPAEIAQQL
IELGVEEGDV IMIDEETGAV SLVGRGEGGE QYDVGRRRIE LPKGPVYKEK EIVRFYTLHD
VDMSLARQRG LISAMLFGFA EEVKEIPEEI RRQSDEIVKK VLEEGKAELV PGVLFIDDVH
LLDIESFSFL MRAMETEFAP IIIMATNRGI ARIRGTDIEA PHGIPQDMLD RLVIIRTRPY
TAEEIREIIS IKANEQKVPL TKEALDLLTS IGVDHSLRYA LQLLTPAYIV AKERGKGSVG
REEIEEVRRH FVSVKESVEY VKSLEEKFLR