Gene Pars_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2198 
Symbol 
ID5054862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1969749 
End bp1970750 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID640469750 
Productband 7 protein 
Protein accessionYP_001154396 
Protein GI145592394 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0330] Membrane protease subunits, stomatin/prohibitin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTATA TACCGGTCCA GGTACGGGCT AGTCGAGTGC CTCGGAGGGC CGCTACGCTG 
TTCGTGGCCT TATTCCTAGC GCTAGTCATA GCCGCGGTGG TCGCCGCGCT TTCTGTATAC
AGCCTACCTG CCGGCGTTGT TGCCGTCGTG GTGGACCCAG TTTCTGGCAC CATCAGCAAG
CCTGTTCTCG GACCTGCGCT GGGGGTGAAA GCGCCCTGGG CTTACCTAAT TGAGGACACC
TACGCCATAG AGATTTTAGA GTTCGCCCAA AAGGAGAAGG CCACGGGGAA GTGGGTCTTC
TCGGCGCCGG AGGTGTTGAC TAAGGATGGG GTCGTTGTCA CGGTGGAGAT GGTGGTGCGC
TACAGAATAG TGCCGGAGCG TTTCGATGAG CTTATAAAGA GGTTTCCCCA AGTGGACTAC
GACGACAAGG TGTTGGTTCC GAAGGCCAGG CAACTTATAC GCGACATAAT TTCGAAGGTC
ACTCTGGACG AATTAATAGC GAGCCGCGAC GTAATTGCAA AGCAGATCGA AGAGACGTAC
AAAACCGCCG TGGAGAACGA CCCCGCTGTG GCGGGGCTTG TGGCAATCCT CGATGTCAAT
GTCCAGAACT TCGTCCTCCC GCAACAGATT ACAGACGCCA TAAACCGCAA AGTGGCGGCT
CAGCAAGACG CCATCCGCGC CCAGTTCGAG AGGCAGAGGG TTGAGGAGCT CGCCCGCGCC
AACTTCACCA GGACGGTCCT CGCCGCCATG GCTGAGGCTA ACGCCACGAT AACGAGGGCG
AGGGCCCAGG CCATGCAAGT CATGTTGGTG GCCAACGCTA CTAGGACCGC CATTGAGATG
ATTATAAGAG CCGCCGGCGC CAACGCCACA GAGGCGGCTA GGCTGGCCGA GCTGTACATA
TACCTAGCCG GGCTGAGGGA GGTGGCGCAG ACGGGCAACG TCCAGATTGT GGCCGTCTCG
GGAGGGGGGC AGGTGGTGCC GGTAATTCCG CTGGCCCGAT GA
 
Protein sequence
MSYIPVQVRA SRVPRRAATL FVALFLALVI AAVVAALSVY SLPAGVVAVV VDPVSGTISK 
PVLGPALGVK APWAYLIEDT YAIEILEFAQ KEKATGKWVF SAPEVLTKDG VVVTVEMVVR
YRIVPERFDE LIKRFPQVDY DDKVLVPKAR QLIRDIISKV TLDELIASRD VIAKQIEETY
KTAVENDPAV AGLVAILDVN VQNFVLPQQI TDAINRKVAA QQDAIRAQFE RQRVEELARA
NFTRTVLAAM AEANATITRA RAQAMQVMLV ANATRTAIEM IIRAAGANAT EAARLAELYI
YLAGLREVAQ TGNVQIVAVS GGGQVVPVIP LAR