Gene Caci_7049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7049 
Symbol 
ID8338416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8192448 
End bp8193629 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content69% 
IMG OID644960130 
ProductPhoH family protein 
Protein accessionYP_003117720 
Protein GI256396156 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.971543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA ACACCGCACC GCGCTCGGGC AAGGGCCGCC AGTCCTCCGC CGCGGACCGC 
ACCGGGAGGG CGGCCGGAGC AGGCCGCGCG GGGGCGAACG GGGCAGCGCC GTCCTCCGGA
ACCTCAGGGA CCCCCAGGAC CGCCGGCGGC GCAGCGCGTC CCGGCCGCGC CGAGGAGGTC
GCCCCGGGCG TCGTCCAGGC CAAGATCGAA ATACCGTCCG GACACCCCCT GGTCTCCGTC
TTCGGCGCCG GCGACGGCCT GCTGAGGGTG ATCGAGAAGG CGTTCCCCGG CGTCGACATC
CACGCCCGCG GCAACCAGGT CACCGTCGCC GGGAGCCAGG GCGAGGTCGG ACTCGTCGAG
CGGCTCTTCG AAGAGATGCT CCTGATGCTC CGCACCGGCG CGCCGCTCAC CGAGGACGGT
GTCGAGCGCT CCATCCAGAT GCTGCGCAGC GGCGAGGCCG ACGTCGACGG GCAGCGCCCC
GCCGAGGTGC TGACGCAGAA CATCCTGAGC AACCGCGGCC GCACCATCCG TCCCAAGACG
CTGAACCAGA AGAACTACGT CGACGCCATC GACCGCAACA CGATCGTGTT CGGCATCGGC
CCGGCCGGTT CCGGCAAGAC CTACCTGGCC ATGGCCAAGG CGGTGCAGGC CCTGCAGAGC
AAGCAGGTCA ACCGCATCAT CCTGACCCGC CCGGCGGTCG AGGCCGGGGA GAAGCTCGGG
TTCCTGCCCG GGACGCTGTA CGAGAAGATC GACCCCTATC TGCGGCCGTT GTACGACGCG
CTGCACGACA TGATCGATCC GGACAGCATC CCGCGGCTGA TGGCGGCCGG GGTGATCGAG
GTGGCTCCGC TGGCATATAT GCGCGGCCGC ACGCTCAATG ACGCGTTCAT CATTCTCGAC
GAGGCGCAGA ACACCTCGCC CGAGCAGATG AAGATGTTCC TCACGCGCCT GGGCTTCGGC
TCGAAGATCG TGGTCACCGG CGACATCACC CAGGTCGACC TGCCCGGGGG GACCGAGTCC
GGGCTGCGGG TCGTGCGCAA TATCCTGACC GGCGTGGAAG ACATCCATTT CGCCGAGCTG
ACCAGCGCCG ACGTGGTACG GCACCGGCTG GTGGGGGACA TCGTCGACGC GTACGGGCGC
TTTGACGCCC GGGGTGGGGA CGCCAAGAGG AAGAGGCACT GA
 
Protein sequence
MADNTAPRSG KGRQSSAADR TGRAAGAGRA GANGAAPSSG TSGTPRTAGG AARPGRAEEV 
APGVVQAKIE IPSGHPLVSV FGAGDGLLRV IEKAFPGVDI HARGNQVTVA GSQGEVGLVE
RLFEEMLLML RTGAPLTEDG VERSIQMLRS GEADVDGQRP AEVLTQNILS NRGRTIRPKT
LNQKNYVDAI DRNTIVFGIG PAGSGKTYLA MAKAVQALQS KQVNRIILTR PAVEAGEKLG
FLPGTLYEKI DPYLRPLYDA LHDMIDPDSI PRLMAAGVIE VAPLAYMRGR TLNDAFIILD
EAQNTSPEQM KMFLTRLGFG SKIVVTGDIT QVDLPGGTES GLRVVRNILT GVEDIHFAEL
TSADVVRHRL VGDIVDAYGR FDARGGDAKR KRH