Gene Acel_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1594 
Symbol 
ID4484647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1793309 
End bp1794880 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content68% 
IMG OID639730378 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_873352 
Protein GI117928801 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.437577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.697312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCCCC CTCCACCGAA CGTCGACGTC CCGGCGCACG GACGGTCGGT CGGCGGCCGA 
GCGGTTCGCG TCGTCCGGCG GATATTTGTG GGACCCCGCA CCCATCTCAC CCTCCGCGCG
CGGGTGAGTC TGCTTGTCGC CGTCGTCGTC GGCCTGGCCG TGGCGGTCAT CTCGTCGGTC
GCCTATGTGA CGGTGCGCGC GGAATTATTG CACCAACTGG ACGCGTCGCT GCGGTCCCGA
GCCGTCGCCG CAGCGGCCGT TCAGCTCGAT ACGGCCTCGA TTCTCGACCA GTCCACGTTG
GTTGTTCTCA CCGCGGAGAA CAAGGTATTC CTCGTGTCGG CGGACGGTGA TGTCATCGGC
AACATTCGCG ACCGGGATTT TGCCGTTCCG CTGGCCAACG GCCCGGAACT TGCGGTCGCA
CAAGGAAAAT TGGCCTGGTC TTTGCGCACG GTCACGGTGG ACGGCGTCGC GTACCGCATG
GTCGCCGTGC CGTCGGCGTC CGGGCAGGCG TTGGTTCTCG CCCGATCGAT GACAGAGACC
TCTGAAACCC TTGCCCGGCT CGGTTTCGTC CTGCTCGCCG TCGGGGTCGC CGGTATCGCC
GTCGCGGCCT TCAGCGGCTT GACCATTGCG CGGGCCGGTT TGCGCCCGGT GGAACGGTTG
ACGGCCGCCG CCGAACGTGT CGCGCGTACC GGCGACCTTA CGCCCATCGA CATCGACCGG
GACGACGAAA TCGGCCGGCT GGCGCAGAGT TTCAACGCCA TGCTGGTCGC GCTCGACCGG
TCCCAGCAGC TGCAGCGTCA ACTGGTGGCC GATGCCGGCC ATGAGCTCCG CACGCCGCTC
ACCAGCCTGC GGACCAATCT CGACTTGCTG GCACAGAGCG AAGCCGCCGG CGACCGCGGC
CTATCCCGAG AAGACCGCCT CGCCTTGTTG GCCGATGTGC GGGCGCAGGT CGAAGAACTC
TCCGGCTTGG TCGCCGATCT CGTCGAGCTG GCCCGGGACG ATGTGCCCGA CCAGCACCTG
GAGGAGATCG ATCTGGCGAC TGTCGCCGAG CGTGCCGTTG AGCGGGTGCG GCGTCGCGCG
TCCGGACTGC GGTTCGACAT TTCCCTGCAA CCGTGGCTGG TTTACGGTGA TCCGACGATG
CTCGAACGTG CGGTGACGAA CCTGCTCGAC AATGCGGTGA AGTGGAGCCC GCCCGGCGGC
CGGGTCGAAT TACGGTTGGA GAACGGCCGG CTCACCGTCA CCGACGAAGG GCCGGGCATC
TCCGACGTCG ACCTGCCGCA CATCTTCGAC CGGTTTTATC GCTCCGCCGA TGCCCGCAAG
ATGCCCGGCT CCGGGCTGGG TCTTGCGATC GTCCGGCACG CCGCACTCCG GCATGGCGGC
ACGATTCAGG CCGGCAAGGC GCCGTCGGGT GGCGCGTTGT TCGTGATGGA ATTACCCGGC
CGCCCGGTGT CGGCCGGTGA GGAGTACGCC GCGGACGCCG AGGAGTTCGC GGCCGAGCAG
CCGCCAGCGG AGCCGCCGCC GAAGCCGTCC CCAGCGATTT CCCAGGAAGA TAGGGGAGTG
TTACGACGAT GA
 
Protein sequence
MIPPPPNVDV PAHGRSVGGR AVRVVRRIFV GPRTHLTLRA RVSLLVAVVV GLAVAVISSV 
AYVTVRAELL HQLDASLRSR AVAAAAVQLD TASILDQSTL VVLTAENKVF LVSADGDVIG
NIRDRDFAVP LANGPELAVA QGKLAWSLRT VTVDGVAYRM VAVPSASGQA LVLARSMTET
SETLARLGFV LLAVGVAGIA VAAFSGLTIA RAGLRPVERL TAAAERVART GDLTPIDIDR
DDEIGRLAQS FNAMLVALDR SQQLQRQLVA DAGHELRTPL TSLRTNLDLL AQSEAAGDRG
LSREDRLALL ADVRAQVEEL SGLVADLVEL ARDDVPDQHL EEIDLATVAE RAVERVRRRA
SGLRFDISLQ PWLVYGDPTM LERAVTNLLD NAVKWSPPGG RVELRLENGR LTVTDEGPGI
SDVDLPHIFD RFYRSADARK MPGSGLGLAI VRHAALRHGG TIQAGKAPSG GALFVMELPG
RPVSAGEEYA ADAEEFAAEQ PPAEPPPKPS PAISQEDRGV LRR