Gene Arth_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2232 
Symbol 
ID4445154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2510857 
End bp2511945 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content62% 
IMG OID639690041 
ProductPhoH family protein 
Protein accessionYP_831712 
Protein GI116670779 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00260346 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAG CAACAAACGG AAGGCCCCGG TTCAGCGCCG GAGAGCGCAC CACAGGTGAA 
TTTCCCCATA GTCTTCCCGG CGTCCGGACG GAGGTGGTCA TCTTCGACAA CTCCGAACAA
ATGGTTCAGT CGCTCGGCAG CCACGATGAG GCGCTGCGTT TCATCGAGGA ACAGTTTCCG
GCCGTCGACT TCCACGTCCG CGGCAATGAA CTGGCCATCA GCGGCCCCGC CGCCGACGTT
CCCAGGATCA TGCGGTTGCT CCATGAAGTG CGCGGGCTCG TTGACCGGGG AACTGTCATT
ACGCCGGCGG TACTCCAGCA GCTCGCAGCC CTGCTCCGGA GCCAGTCACT CCAAAACCCG
GTTGAGGTAC TGACCTACAA CATCCTCTCG AGCCGCGGCA AGACGATCCG GCCCAAGACG
CTGAACCAGA AGAACTACGT GGATGCCATT GACGCCCACA CGGTGATCTT CGGAATAGGA
CCTGCCGGTA CCGGCAAGAC GTTCCTCGCC ATGGCGAAGG CAGTCCAGGC ACTGCAGCAG
AAGGAAGTCA GCCGCATCAT CCTTACCAGG CCAGCGGTGG AGGCCGGCGA ACGGCTGGGC
TTCCTGCCCG GGACGCTGAG CGACAAGATC GATCCTTACC TGCGTCCCCT TTACGATGCA
CTGCACGACA TGATGGACCC GGAGTCCATT CCCCGCTTGA TGGCGGCAGG CACCATCGAA
GTGGCTCCGC TGGCCTATAT GCGCGGCCGG ACCCTCAACG ATGCCTTCAT CATCCTCGAC
GAGGCCCAGA ACACCACACC AGAGCAGATG AAGATGTTCC TGACCCGGCT CGGCTTCGGC
TCCAAAATGG TGGTCACCGG GGATGTCACC CAAGTGGACC TGCCATTCGG CACCCGTTCC
GGCCTCCGGA TCGTGGAAGA GATCCTGCAG GGAATTGAGG ACGTGAATTT CAGCGTGCTG
GATGCCTCGG ACGTGGTCCG CCACCGGCTG GTGGGCGACA TCGTCAATGC GTACGGCGTG
TGGGACGAGG CACAGAGGAA CCGGGTGAAG CATTCCGTGA CCCGGGAGAA GCGGGGAGAG
CACGCATGA
 
Protein sequence
MTEATNGRPR FSAGERTTGE FPHSLPGVRT EVVIFDNSEQ MVQSLGSHDE ALRFIEEQFP 
AVDFHVRGNE LAISGPAADV PRIMRLLHEV RGLVDRGTVI TPAVLQQLAA LLRSQSLQNP
VEVLTYNILS SRGKTIRPKT LNQKNYVDAI DAHTVIFGIG PAGTGKTFLA MAKAVQALQQ
KEVSRIILTR PAVEAGERLG FLPGTLSDKI DPYLRPLYDA LHDMMDPESI PRLMAAGTIE
VAPLAYMRGR TLNDAFIILD EAQNTTPEQM KMFLTRLGFG SKMVVTGDVT QVDLPFGTRS
GLRIVEEILQ GIEDVNFSVL DASDVVRHRL VGDIVNAYGV WDEAQRNRVK HSVTREKRGE
HA