Gene Pars_0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0026 
Symbol 
ID5055222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp19392 
End bp20507 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content55% 
IMG OID640467606 
Producthypothetical protein 
Protein accessionYP_001152295 
Protein GI145590293 
COG category[S] Function unknown 
COG ID[COG1415] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACTTG CCGGAACGGC GGATTTGCCT CTTCACGACG GTACAGTGCC GTACTGGTTG 
CTTTCCAGGA TGAAGAAGCT GGCATCGCTT GTATTAACCA TTATGCACGA TATATACGGC
CCAGACGGCA TTGTGGAGAG GTTTGCCCAT CCTGTATTTT TCCAGGCGTT TAATGATCTC
ATCGGGATGG ATTGGGACAG CTCCGGCAGT ACTACCGTCA CTACTGCCGT TGTTAAGGAG
GCGTTGTCCA AGTCCGAGAT CCCGGTTAGA GTGGCAGGGG GCAAGGGGAG GCAGGCCTTA
AACGCGCCGA ACGAACTGGC GGAGATATCC CGGCAATTTA ATCTAGACGC CGAGAGGCTA
ATCGCCACTT CTAGGCTAGT GGCGAAAGTG GACAACGTAT TGGTGCAAGA CGGCTACGAC
TTGTACCACC ACGCCTTCTT CGTCTCCTCC ACCGGCAGGT GGGCGGTGGT ACAACAAGGC
TTAAATCCAG AAGTTAGGAT GGCTCGTCGG TACCACTGGC TCGCTACTGA AAACTACTTT
GACAGCCCCC ACACCGGCGT GGTGGGCGTA AAGCGCAATA GAGTGCTCAA CTTGGCGTCG
GCAAAGAGCA GAGAGAATAG GTCCGTCATC CTAGAACTCG TAAACGAGGG GGCAACGAAG
GTGGCGAAAT ACCTCGCGCT TCTACGCGGA CAAGCGACTC TCTTCGACGT ACCAAGATAC
CACCCCTATA CGAAAATCGA CATAGAAGTA AGGACAGTTG TGAAGAATTT GCCCCCGCCC
AAGTCGGTAA CCGACTTCAA GGAGCTTCTT CTGCAATACC GCGTGGGACC TAAAACCCTC
CGGGCCCTTT CGCTGGTGGC GGAGCTTGTG TTTAAGACCC CCGCCGACTG GAACGACCCG
GCAACGGACC CATTCAAATT CGCCTTCGCA GTAGGCGGAA AGGACGGCAT ACCCTACCCC
GTTGATAGAA GGACATACGA CGAGCTCATA GCTATACTCG ACGTCGTGGT GGACAAGGCA
AGGAGTGATC CAGGCCTCTA CCGCTACCTT TCTCACCTAG CCAAGAAGGC CGAGGCGTGG
AGATACCCCC AAGACAAGAA AAAGCCGACG CTTTAA
 
Protein sequence
MRLAGTADLP LHDGTVPYWL LSRMKKLASL VLTIMHDIYG PDGIVERFAH PVFFQAFNDL 
IGMDWDSSGS TTVTTAVVKE ALSKSEIPVR VAGGKGRQAL NAPNELAEIS RQFNLDAERL
IATSRLVAKV DNVLVQDGYD LYHHAFFVSS TGRWAVVQQG LNPEVRMARR YHWLATENYF
DSPHTGVVGV KRNRVLNLAS AKSRENRSVI LELVNEGATK VAKYLALLRG QATLFDVPRY
HPYTKIDIEV RTVVKNLPPP KSVTDFKELL LQYRVGPKTL RALSLVAELV FKTPADWNDP
ATDPFKFAFA VGGKDGIPYP VDRRTYDELI AILDVVVDKA RSDPGLYRYL SHLAKKAEAW
RYPQDKKKPT L