Gene Acel_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0197 
Symbol 
ID4485283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp208420 
End bp209907 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content69% 
IMG OID639728960 
ProductPhoH family protein 
Protein accessionYP_871957 
Protein GI117927406 
COG category[T] Signal transduction mechanisms 
COG ID[COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.213754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGA CGTATGAACC GCCTGCCTCA CCGAGCCGGC CGGCGGGCGG AAATCCCGAC 
ACCGTAACTC CCGCCCCGGG TACTGAGCAT CCCGTCCCTG GCGCTGGGCC GCTGACGCGG
AGTGCGGCGC AGACCGGCCA CGTCGTCGTA CTCGACACCT CGGCGCTGAT GGCCGACCCC
GAGGGCATTT TCGACGCGTA TCCCGGCGCT GACCTCGTCA TCCCGCTTGC CGTCATTCAG
GAGCTTGACG GCCTGAAGAA ACGCTTGGAT CCTGCCGGCG CGGCGGCCCG CGACGTGCTG
CGCCGGGTGG AAGCACTCCG GCTCGCCGCC GGCGGGGACC TGCGCGAGCC GTTGCAGCGC
CGGGTCGGGG GGACCGTCCG GATGCAGGTG AACGGCGTGC ACGCCGCCTT GCTCCGGCGC
TATGGCCTGC ATGCCGACAG TCCGGACAAC CGGATCGTCG GCACAGCTCT TGCCCTGGCC
GCCGCGCAAC CCGACGGTCC GCAGGTGCTC GTGGTCTCCA ACGACACCGC CCTGCGGCTG
ACCGCGGCCG CCGTCGGCCT GGCTGCGGCG GAGCACAACC CGCTGGACGC CGCCCCCGCC
CACGCCCGGC CGGACGGCTG GATCGCCCTC GAGGTTTCGC CGGAGGTCCT CGACCAGCTC
TTCGAGAACC GGCTGATTGA CGCCGCGAGT GTCGGCGCGT CGGACGTGGC GGAGAACACG
TTCGTCGTCC TGCGCGCCGG AACGAGTTCG GCGCTGGCGC GCTACCGCAA CGGCCTGCTC
CGGGTGCTCG ACCAATTGCC GCCGATCTGG GGTCTGCGGC CTGCGAACAA GGAGCAACGC
TTCGCGCTGG ACCTGCTGCT GGATGACGAC GTGCGGGTCA TCGTCCTGGA CGGGCCGGCT
GGGACAGGCA AGACGCTCTG CGCGGTGGCG GCAGGCCTGC ACATGGTGGT CGAGCAGCAC
CGCTTCGAGC GGATGTCGGT GTACCGGCCG GTCATACCGG TAGGCCAGGC GGATCTCGGC
TACCTCCCCG GCACCTTGGA CGAGAAGATC GATCCGTGGA TGGCTGCCAT CACGGATGCC
GTGGCGGCGC TCTCCGGGGA CGGGCTGGAT CGACGCCGCC GCCAGGAGGG GACGCGCGGC
AAGGTCGCGC AAGATTCGCT CGATTACATC AAGGCGCAGG GGTTGCTCAC CATGGAGTCG
GTGACCCACC TGCGTGGACG CACGCTGCAC TCGACTTTCG TCCTCGTGGA CGAGGCGATG
AACCTCTCCC CGCAGGTCGG CAAGACCCTG CTCACCCGGA TCGGCGCTGA CTCTAAGATC
GTTCTGACTG GGGACACGTC ACAAATTGAC GCACCGTTTC TGTCTGAGCG GACGAACGCG
CTGACCGCCG TCGTGTCTGC CTTTGCCGGT CAGCCGTGCT TCGGCCACGT GCGGCTCACC
CGCGGTGAGC GATCACCCGT TGCGGAGCTC GCAGCCCGTC TCATGTGA
 
Protein sequence
MATTYEPPAS PSRPAGGNPD TVTPAPGTEH PVPGAGPLTR SAAQTGHVVV LDTSALMADP 
EGIFDAYPGA DLVIPLAVIQ ELDGLKKRLD PAGAAARDVL RRVEALRLAA GGDLREPLQR
RVGGTVRMQV NGVHAALLRR YGLHADSPDN RIVGTALALA AAQPDGPQVL VVSNDTALRL
TAAAVGLAAA EHNPLDAAPA HARPDGWIAL EVSPEVLDQL FENRLIDAAS VGASDVAENT
FVVLRAGTSS ALARYRNGLL RVLDQLPPIW GLRPANKEQR FALDLLLDDD VRVIVLDGPA
GTGKTLCAVA AGLHMVVEQH RFERMSVYRP VIPVGQADLG YLPGTLDEKI DPWMAAITDA
VAALSGDGLD RRRRQEGTRG KVAQDSLDYI KAQGLLTMES VTHLRGRTLH STFVLVDEAM
NLSPQVGKTL LTRIGADSKI VLTGDTSQID APFLSERTNA LTAVVSAFAG QPCFGHVRLT
RGERSPVAEL AARLM