Gene Bind_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1056 
Symbol 
ID6201139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1212596 
End bp1213621 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID641705049 
ProductPhoH family protein 
Protein accessionYP_001832188 
Protein GI182678042 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.563658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.821458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCCC TTCCTTCAGG AGAGCCGACG GCCGAGATGA CCCTCGCCTT TGACGATAAT 
CGTTATGCCT TATTGGTCTT TGGCCAATAT GACCAGAACA TTGCCAAGGT TGAACGCCGC
CTGGGTGTCA GCGCTCATGC CAATGGCAAT CATATTACCA TCAAGGGGCC GCAAGGCGCT
TGTGAGCAGG CCCGCCGCGT GTTCGAATCG CTTTATGCGC GGGTCAAGCT CGGCCATCCG
ATCAGCCTTG GTGATGTCGA TGGCGCGATC GAGGAAGGCG TGGTGCAAGG CAGCTTGTTT
CCAGGCGAGA GCGAGGTCGG TCGCCCGGTT TTTGAACAGA TCGCGACGCG GCGACGTGGC
CCGGTGCGCG CCCGCACGGC CGCCCAGGAT TATTACTTGC GAACTTTGAA ACAATCGGAG
CTGGTTTTCG CCGAGGGACC GGCCGGCACG GGCAAGACGT GGCTCGCCGT GGGTTTTGCT
GTGTCTCTCC TCGAACAAGG CCGTGTCGAT CGGCTGATCC TGTCGCGGCC TGCCGTCGAA
GCGGGTGAGC GTCTGGGCTT TCTGCCGGGC GATATGCGCG ACAAGGTCGA TCCTTATCTG
CGGCCGATCT TCGATGCCTT GAATGATTTC ATGGACCCTC GCCTCCTGGA GCGGGGCATG
CAGACTGGTA TGATCGAGGT GGCGCCGCTT GCCTTCATGC GCGGGCGCAC TTTGAGCAAT
GCCTGCGTCT TGCTCGACGA GGCGCAGAAC GCGACCTCGA TCCAGATGAA GATGTTTCTG
ACGAGACTGG GTGAAAATTC GCGCATGATC GTCACCGGCG ATCCGACCCA GACCGATCTG
CCGTCCACGC AGAAATCCGG CCTGAGCGAG GCGATCAACC TTTTGTCGGA GCTTGAAGGC
GTGGGGCACG TCGTTTTTCG CGAAGGCGAT GTCGTGCGGC ATGATCTGGT GCGCCGTATC
GTCGGCGCTT ATGAAGCCGC GTCGCGCGGC GACAACGAGT CGGCAAGACC CATCGGGAGA
GCATGA
 
Protein sequence
MPALPSGEPT AEMTLAFDDN RYALLVFGQY DQNIAKVERR LGVSAHANGN HITIKGPQGA 
CEQARRVFES LYARVKLGHP ISLGDVDGAI EEGVVQGSLF PGESEVGRPV FEQIATRRRG
PVRARTAAQD YYLRTLKQSE LVFAEGPAGT GKTWLAVGFA VSLLEQGRVD RLILSRPAVE
AGERLGFLPG DMRDKVDPYL RPIFDALNDF MDPRLLERGM QTGMIEVAPL AFMRGRTLSN
ACVLLDEAQN ATSIQMKMFL TRLGENSRMI VTGDPTQTDL PSTQKSGLSE AINLLSELEG
VGHVVFREGD VVRHDLVRRI VGAYEAASRG DNESARPIGR A