Gene Pars_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1339 
Symbol 
ID5054156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1205088 
End bp1206095 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content57% 
IMG OID640468885 
Producthypothetical protein 
Protein accessionYP_001153554 
Protein GI145591552 
COG category[S] Function unknown 
COG ID[COG1817] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00496321 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.353078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATTGGC GCTTCATGGT TAGGTTTCTC TCCGACGCGT TGACGCCGAA GCAGGCACGT 
ATCGCCGCGT TGCTCAAGCT TGAGGGGGCC AAGCGCGGCG TTGAGGTGGA GATAACGTGC
CGCCACTACA TGCATGTTTC AGACATACTC GACATGTACG GCGTCTCTTA TAGATGTTTT
GGACAATACG GCCTCACTGT ATACGAAAAG CTTGTGTACG GCATCGAGAG ACAGAGGGAG
TTGGCCGAGG TGGCGAGGCA GGTAGACGGA ATGCTGGGCT TCCCATCCCC AGACGCGGCG
AGGGTGGTGT TTGGGCTGGG AAAGCCCGTG TTGGTGCTCA ACGACACCCC CCACGCAACT
CACGTAAATA GGCTAGTCAT ACCGCTTTCG GAAGCTCTCG TAGCACCCGC GGCCATCCCC
GAGGAGATGT GGCGCCCCTA CTGCCCCAGG AAAGTTGTCA CTTTCGACGG GGTATTCGAG
TATATGTGGA CGTCGAGGTT TAAACCTGAT GAGTCTGTGG TGAAGAGCCT CGGCTTGGAG
CCAGGCGGAT ACGTGGTTTT TAGGCCGGAG GAGAGGTATG CGGCGTATTA CAAGTGGGAA
TACACAGAGC TTCGCATAAA GCTGGCTAGG GCTGTGGAGG GCCTTGGTTA CAATGTAGTT
AACGTGCCGC GCTATCCGGA CCAGGTGCTG GAGGGGGCCA TCAACTTGAC TAGGGCTGTG
GATCACTTGC AACTGGCATA CTTCTCGGCG GGGGTTATAA CTGGGGGCGC CTCGATGGCC
ACAGAAGCTG CGCTTCTAGG CGTGCCTGCG TTGTCCTATT TCCCCCAGAG CTACTACGTA
GATCGTTATC TTGCAGAGAA GGGAGCCCCG CTTTACCGGT GCGACAGCTT AGAGACTTGC
CTCTCGAGTC TCAGAGAGAT GTTGCGCCGC GGCAGGTCTG CGCCAGTAAG GCTTGAAGAC
CCCGCCGGGA TTATTTTCGA TGCGGCACTA AGCGCTGTTT CAAGATAA
 
Protein sequence
MYWRFMVRFL SDALTPKQAR IAALLKLEGA KRGVEVEITC RHYMHVSDIL DMYGVSYRCF 
GQYGLTVYEK LVYGIERQRE LAEVARQVDG MLGFPSPDAA RVVFGLGKPV LVLNDTPHAT
HVNRLVIPLS EALVAPAAIP EEMWRPYCPR KVVTFDGVFE YMWTSRFKPD ESVVKSLGLE
PGGYVVFRPE ERYAAYYKWE YTELRIKLAR AVEGLGYNVV NVPRYPDQVL EGAINLTRAV
DHLQLAYFSA GVITGGASMA TEAALLGVPA LSYFPQSYYV DRYLAEKGAP LYRCDSLETC
LSSLREMLRR GRSAPVRLED PAGIIFDAAL SAVSR