Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3832 |
Symbol | |
ID | 5704856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4363869 |
End bp | 4364858 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641273254 |
Product | PhoH family protein |
Protein accession | YP_001538616 |
Protein GI | 159039363 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00992918 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCAATC TCCTCGGTGC GGGTGACGAG ATCCTGCGAC TGGTGGAGCG CTCCGTGAGC AGCGACGTGC ACGTTCGGGG CAACGAGATC ACGATCACCG GCGCACCCGC GGACAACGCT CTCGCCGAGC GGCTCTTCGG CGAACTGATC GAACTCATCG AGAAAGGTGA GACGCTGACC ACCGACGCCG TCCGGCGCAC CGTCGGCATG CTCGAGCAGG GCAGCGCCGA GCGGCCCGCC GAAGTCCTCA CGCTCAACAT CCTCTCCCGG CGCGGTCGCA CCATTCGCCC CAAGACACTC GGGCAGAAGC GCTACGTCGA TGCGATCGAC GCGCACACCA TTGTCTTCGG CATCGGTCCG GCTGGCACCG GCAAGACCTA CCTGGCGATG GCGAAAGCAG TCCAGACGCT TCAGGCCAAG CAGGTCAACC GGATCATCCT CACCCGGCCG GCGGTCGAGG CGGGCGAGCG GCTGGGCTTC CTGCCCGGCA CGCTGAACGA GAAGATCGAT CCCTATCTGC GACCGCTCTA CGACGCGCTG CACGACATGC TCGACCCAGA GTCGATCCCG AAGCTGATGG CGGCGGGCAC GATCGAGGTG GCACCGCTGG CATACATGCG GGGTCGGACG CTCAACGACG CGTTCATCAT CCTGGACGAG GCGCAGAACA CGACCCCCGA GCAGATGAAG ATGTTTCTCA CTCGGCTCGG CTTCGGTTCC AAGATTGTCG TCACCGGTGA TGTCACCCAG GTGGACCTTC CCGGCGGAAC GACCAGTGGC CTGCGGGTCG TCCGGGAGAT CCTCACCGAT GTGGAGGACG TGCACTTCGC CCAGCTCTCC AGCTCGGACG TGGTGCGGCA CCGGTTGGTC GGCGAGATCG TCGACGCGTA CGCCCGCTGG GACGTCGAAC GGGAGAACCA GCAGGCGAAG AGCGTGCACG CGGTGCCCGG ACGGGCCGCC CAGGGCGGCC GTGCCGGTCG GCGCCGCTAA
|
Protein sequence | MVNLLGAGDE ILRLVERSVS SDVHVRGNEI TITGAPADNA LAERLFGELI ELIEKGETLT TDAVRRTVGM LEQGSAERPA EVLTLNILSR RGRTIRPKTL GQKRYVDAID AHTIVFGIGP AGTGKTYLAM AKAVQTLQAK QVNRIILTRP AVEAGERLGF LPGTLNEKID PYLRPLYDAL HDMLDPESIP KLMAAGTIEV APLAYMRGRT LNDAFIILDE AQNTTPEQMK MFLTRLGFGS KIVVTGDVTQ VDLPGGTTSG LRVVREILTD VEDVHFAQLS SSDVVRHRLV GEIVDAYARW DVERENQQAK SVHAVPGRAA QGGRAGRRR
|
| |