Gene Sare_3832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3832 
Symbol 
ID5704856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4363869 
End bp4364858 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content67% 
IMG OID641273254 
ProductPhoH family protein 
Protein accessionYP_001538616 
Protein GI159039363 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00992918 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCAATC TCCTCGGTGC GGGTGACGAG ATCCTGCGAC TGGTGGAGCG CTCCGTGAGC 
AGCGACGTGC ACGTTCGGGG CAACGAGATC ACGATCACCG GCGCACCCGC GGACAACGCT
CTCGCCGAGC GGCTCTTCGG CGAACTGATC GAACTCATCG AGAAAGGTGA GACGCTGACC
ACCGACGCCG TCCGGCGCAC CGTCGGCATG CTCGAGCAGG GCAGCGCCGA GCGGCCCGCC
GAAGTCCTCA CGCTCAACAT CCTCTCCCGG CGCGGTCGCA CCATTCGCCC CAAGACACTC
GGGCAGAAGC GCTACGTCGA TGCGATCGAC GCGCACACCA TTGTCTTCGG CATCGGTCCG
GCTGGCACCG GCAAGACCTA CCTGGCGATG GCGAAAGCAG TCCAGACGCT TCAGGCCAAG
CAGGTCAACC GGATCATCCT CACCCGGCCG GCGGTCGAGG CGGGCGAGCG GCTGGGCTTC
CTGCCCGGCA CGCTGAACGA GAAGATCGAT CCCTATCTGC GACCGCTCTA CGACGCGCTG
CACGACATGC TCGACCCAGA GTCGATCCCG AAGCTGATGG CGGCGGGCAC GATCGAGGTG
GCACCGCTGG CATACATGCG GGGTCGGACG CTCAACGACG CGTTCATCAT CCTGGACGAG
GCGCAGAACA CGACCCCCGA GCAGATGAAG ATGTTTCTCA CTCGGCTCGG CTTCGGTTCC
AAGATTGTCG TCACCGGTGA TGTCACCCAG GTGGACCTTC CCGGCGGAAC GACCAGTGGC
CTGCGGGTCG TCCGGGAGAT CCTCACCGAT GTGGAGGACG TGCACTTCGC CCAGCTCTCC
AGCTCGGACG TGGTGCGGCA CCGGTTGGTC GGCGAGATCG TCGACGCGTA CGCCCGCTGG
GACGTCGAAC GGGAGAACCA GCAGGCGAAG AGCGTGCACG CGGTGCCCGG ACGGGCCGCC
CAGGGCGGCC GTGCCGGTCG GCGCCGCTAA
 
Protein sequence
MVNLLGAGDE ILRLVERSVS SDVHVRGNEI TITGAPADNA LAERLFGELI ELIEKGETLT 
TDAVRRTVGM LEQGSAERPA EVLTLNILSR RGRTIRPKTL GQKRYVDAID AHTIVFGIGP
AGTGKTYLAM AKAVQTLQAK QVNRIILTRP AVEAGERLGF LPGTLNEKID PYLRPLYDAL
HDMLDPESIP KLMAAGTIEV APLAYMRGRT LNDAFIILDE AQNTTPEQMK MFLTRLGFGS
KIVVTGDVTQ VDLPGGTTSG LRVVREILTD VEDVHFAQLS SSDVVRHRLV GEIVDAYARW
DVERENQQAK SVHAVPGRAA QGGRAGRRR