Gene Pars_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2115 
Symbol 
ID5055229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1890749 
End bp1891864 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content55% 
IMG OID640469667 
ProductPhoH family protein 
Protein accessionYP_001154313 
Protein GI145592311 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACA AGATTAAGCC AATGACGGTA GGACAGGAGA GGGCTACGAA CGTCTTAAAA 
GACCCCGAGA ACGAGTTAAT CGGGCTGTTT GGCCCCACGG GCACTGGGAA GTCCCTGCTA
AGCATTGCCT ACGGGATCTG GGCTGTGGAG AACGGAAAGG CAAAGAGGTT TATCATAGCG
AGGCCTATTG TCGACGTTGC TACTGGTGAG GTTCTAACGC CGGAGAGACT CGGCGAGATG
TATTACAAGA TCGCCGCGGC GTATCTCGAG GACATCTTGG GCCCATATGC CGAAAGGGGG
TACATCGAGA AGCTGATAAA AGAGGAAAAG GTAATTGTGA CAGACGTCTC CTACCTAAGG
GGGCGCACCT TTGACGACAG CGTGATATTC CTAGACGACG CCCAAAACGT CAAGCCGGAG
AGCGCCGCGG AGATTTTAAT CCGCCTGGGG CGGGGCAGCC GGCTGATAGT GGCTGGCGAC
CCCATCTTCC AAAAGCCCGC TGACGCTGAG AAAGACGGCG CAACGCTCCT CCGTGAGGCC
CTCCTAGGCG AGGAGAAGGC CGAGGTTGTG GATTTAGGAG TTAAGGATAT TGTGAGGCCG
GGGGCAAGGC GGGGGATCAA GCTAGCTCTG GAGTTGAGAA TGAGGAAGAG ACAGCTCTCC
GAGGCTGAGC GATACATCTA CGAGACAGCT AGGATCTTCG CACCCGACGC CGATATCATA
ACCGCCGTCG AGTTTAGGGC AGACAAAGAC TCCTTAGGTA TAAGAGGCGA CAATGTCCCT
GATGCCATCA TCATGGTTAA GGAGGGCCAG CTGGGCAGAG TAGTTGGCCG CGGCGGAGAG
CGTATTAAGA CCATAGAGGG GGAGGCCAGC GCGAGGCTTA GGTTGTTGGA GATGTCTCTT
GATTTTAAGC AGTGGGTCAG AGCAATCCAC CCAGTAGGCT GGATTTCTAA ACACATCGTC
GACGCCGACT TTGCAGGCCC CGAGCTACAG ATCCAGGTCA GGAGAAGCGA GTTCGGCGCG
TTTATAGGCC AGAGGGGGGC GTACATTAGG TTGATAGACC GCGTCTTTAG GAAACTACTG
GGAATTGGGG TCCGCGCTGT CGAAGCTGAG GAATAG
 
Protein sequence
MFDKIKPMTV GQERATNVLK DPENELIGLF GPTGTGKSLL SIAYGIWAVE NGKAKRFIIA 
RPIVDVATGE VLTPERLGEM YYKIAAAYLE DILGPYAERG YIEKLIKEEK VIVTDVSYLR
GRTFDDSVIF LDDAQNVKPE SAAEILIRLG RGSRLIVAGD PIFQKPADAE KDGATLLREA
LLGEEKAEVV DLGVKDIVRP GARRGIKLAL ELRMRKRQLS EAERYIYETA RIFAPDADII
TAVEFRADKD SLGIRGDNVP DAIIMVKEGQ LGRVVGRGGE RIKTIEGEAS ARLRLLEMSL
DFKQWVRAIH PVGWISKHIV DADFAGPELQ IQVRRSEFGA FIGQRGAYIR LIDRVFRKLL
GIGVRAVEAE E