Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2115 |
Symbol | |
ID | 5055229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1890749 |
End bp | 1891864 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469667 |
Product | PhoH family protein |
Protein accession | YP_001154313 |
Protein GI | 145592311 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGACA AGATTAAGCC AATGACGGTA GGACAGGAGA GGGCTACGAA CGTCTTAAAA GACCCCGAGA ACGAGTTAAT CGGGCTGTTT GGCCCCACGG GCACTGGGAA GTCCCTGCTA AGCATTGCCT ACGGGATCTG GGCTGTGGAG AACGGAAAGG CAAAGAGGTT TATCATAGCG AGGCCTATTG TCGACGTTGC TACTGGTGAG GTTCTAACGC CGGAGAGACT CGGCGAGATG TATTACAAGA TCGCCGCGGC GTATCTCGAG GACATCTTGG GCCCATATGC CGAAAGGGGG TACATCGAGA AGCTGATAAA AGAGGAAAAG GTAATTGTGA CAGACGTCTC CTACCTAAGG GGGCGCACCT TTGACGACAG CGTGATATTC CTAGACGACG CCCAAAACGT CAAGCCGGAG AGCGCCGCGG AGATTTTAAT CCGCCTGGGG CGGGGCAGCC GGCTGATAGT GGCTGGCGAC CCCATCTTCC AAAAGCCCGC TGACGCTGAG AAAGACGGCG CAACGCTCCT CCGTGAGGCC CTCCTAGGCG AGGAGAAGGC CGAGGTTGTG GATTTAGGAG TTAAGGATAT TGTGAGGCCG GGGGCAAGGC GGGGGATCAA GCTAGCTCTG GAGTTGAGAA TGAGGAAGAG ACAGCTCTCC GAGGCTGAGC GATACATCTA CGAGACAGCT AGGATCTTCG CACCCGACGC CGATATCATA ACCGCCGTCG AGTTTAGGGC AGACAAAGAC TCCTTAGGTA TAAGAGGCGA CAATGTCCCT GATGCCATCA TCATGGTTAA GGAGGGCCAG CTGGGCAGAG TAGTTGGCCG CGGCGGAGAG CGTATTAAGA CCATAGAGGG GGAGGCCAGC GCGAGGCTTA GGTTGTTGGA GATGTCTCTT GATTTTAAGC AGTGGGTCAG AGCAATCCAC CCAGTAGGCT GGATTTCTAA ACACATCGTC GACGCCGACT TTGCAGGCCC CGAGCTACAG ATCCAGGTCA GGAGAAGCGA GTTCGGCGCG TTTATAGGCC AGAGGGGGGC GTACATTAGG TTGATAGACC GCGTCTTTAG GAAACTACTG GGAATTGGGG TCCGCGCTGT CGAAGCTGAG GAATAG
|
Protein sequence | MFDKIKPMTV GQERATNVLK DPENELIGLF GPTGTGKSLL SIAYGIWAVE NGKAKRFIIA RPIVDVATGE VLTPERLGEM YYKIAAAYLE DILGPYAERG YIEKLIKEEK VIVTDVSYLR GRTFDDSVIF LDDAQNVKPE SAAEILIRLG RGSRLIVAGD PIFQKPADAE KDGATLLREA LLGEEKAEVV DLGVKDIVRP GARRGIKLAL ELRMRKRQLS EAERYIYETA RIFAPDADII TAVEFRADKD SLGIRGDNVP DAIIMVKEGQ LGRVVGRGGE RIKTIEGEAS ARLRLLEMSL DFKQWVRAIH PVGWISKHIV DADFAGPELQ IQVRRSEFGA FIGQRGAYIR LIDRVFRKLL GIGVRAVEAE E
|
| |