Gene Acel_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2100 
Symbol 
ID4485691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2376405 
End bp2378318 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content66% 
IMG OID639730901 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_873858 
Protein GI117929307 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.269626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGC ACCTGCTCGA TGCGATCACT TGCTCGGGCC CGGTCGTCGT CCTGCGTCTG 
GACAACACCG GCATCACCTA CGCCAGCCCA AACGCCGCCA CCCTGCTCGG CATCTCCCCC
GAGCAGGCGC GGCGCCCGGA GACGTGGCTG GCAGCAGCGC CGGCCGATCT GTTCGAGACG
CTTCGCACCG CGGTCCGCGG ACTCCTCGCC GGCGAGACGA CGGGGCCGAT CGAACTTGAG
GGTGAGGTCA ACTTTGGCGT GCGCCCTCGA GTCCTGCAGA TTGTTCTCAA CCCTGAGCCG
ACCGCCGGCC GGCGGATCCG GGCGGTTTTT GCTTTCCTTC AGGACGTTAC CGAGCGGCGA
GCCGCCGAGC GCACCGTTGC TGATCGTGAA CGCCAGCTCC GCGCGATCAC CTCGGCTTCT
CCCGATGTGA TTGCGGTCGT TGGCCCTGAC CTGCGGGTCA GGTTCGTCAG CGATGCATCG
ACCGGCCTCA CCGGTATCCG GGCCAGTGAC CGGATTGGCG CGCGGATCGG GGAGACCGTA
CATCCCGACG ACCGCCCCGC CCTGCTTGAC GGCATTCGTG CCGTTCTCAC CGGTGCGGCC
GAAGATTTTG TCGTACGGGT TCGGACCCGG CATGCCAGCG GTCGATACAT CGTGTTGGAA
GGCCACGGCC GGCCGGCGCT CGGGCCCGAC GGCGCACCAA CGGCAGCGGT CATCGTCTTC
CGGGATATCA GCGAGCGGCT CGCGTTGGAG GCGGAGCTGG TGAGGGCCAA GGAAGCGGCG
GACGCCGCGT CGGCCGCGAA GAGCGATTTT CTCTCCCGCA TGAGTCATGA ATTGCGCACT
CCGCTCAACG TTATCCTCGG ATTTTCCCAG CTCCTGCAGA TGGAGCGGCT GACCGATGAG
CAGCAGGAGT GGGTCAACCA GATTTTCAAG GCCGGCCGGC ATTTGTTGGA CCTCATCAAT
GAGGTGCTCG ACATCACGCG CATCGAGAGC GGCCGGTTGG CTCTCTCCCT GGAGGCCGTA
TCACTGCGCG ACGTCATCGG AGAAACGATG GTTGCACTCG CGCCATTGGC AGCCGAGCGG
GAGGTCACTG CGGATTGGGT TATCGAGGAC GACGATGTCA CGGTCCGCGC CGACCGGCAG
CGGCTCCGCC AGGTGATGCT GAATCTGGTC GTCAATGCCA TTAAGTACAA CCGTCGCGGC
GGCACCGTAT TGGTGAGCGC ACGGCGGTGC GCCGACCGGG CACTGATCCG CGTGGCCGAC
ACCGGTATCG GCATTGCGCC TGAGCACATC GAGCGGCTTT TCGTCCCCTT CGACCGCCTC
GGTGCGGACG CGATCGACGC CGAGGGCACC GGCGTCGGAC TTCCGCTTAC GCTCCGGCTC
GTGCAGGTCA TGGACGGTGA GCTCACCGTT GACTCCACGC CCGGAGACGG CAGTGTCTTC
ACCGTGGCCC TTCCGCTCGC CGCACCGATG CCGCTCGACC CGGAGAGCCC TGCGGAGAGT
GCGCACGACG GCGCGCCTGC CGCGACGGGA ACGGTCTTGT ACATCGACGA TAACGACCAT
GACGACGGCG TGCTCCGGCA CATTGTCGCG TTGCGGCCGG GAATACGGTG GCTGCGCGCG
GAATGCGGAT CGAAGGGCAT GGAACTCCTG CGCCGCGAGT CGGTCGACCT GGTTTTTCTT
GACGTGCACC TGCCGGACAT GAGCGGCTTC GATGTGCTTC GTGAAGTCCG CAGCGACCCC
CTGACTGCTG CCCTGCCTGT CTATATGGTG AGCGCGGACG CAACCTCGGG CCAGGCTCAA
CGCATGAGTA GCCTGGACGC CACCGGTTTC GTCCCGAAGC CGGTCGATGT CCCCCGCCTT
CTCGGGATCG TCGATTCGGT TCTCCGCGCA AAGTCCGGCG GGCGGGATGG GTAG
 
Protein sequence
MSRHLLDAIT CSGPVVVLRL DNTGITYASP NAATLLGISP EQARRPETWL AAAPADLFET 
LRTAVRGLLA GETTGPIELE GEVNFGVRPR VLQIVLNPEP TAGRRIRAVF AFLQDVTERR
AAERTVADRE RQLRAITSAS PDVIAVVGPD LRVRFVSDAS TGLTGIRASD RIGARIGETV
HPDDRPALLD GIRAVLTGAA EDFVVRVRTR HASGRYIVLE GHGRPALGPD GAPTAAVIVF
RDISERLALE AELVRAKEAA DAASAAKSDF LSRMSHELRT PLNVILGFSQ LLQMERLTDE
QQEWVNQIFK AGRHLLDLIN EVLDITRIES GRLALSLEAV SLRDVIGETM VALAPLAAER
EVTADWVIED DDVTVRADRQ RLRQVMLNLV VNAIKYNRRG GTVLVSARRC ADRALIRVAD
TGIGIAPEHI ERLFVPFDRL GADAIDAEGT GVGLPLTLRL VQVMDGELTV DSTPGDGSVF
TVALPLAAPM PLDPESPAES AHDGAPAATG TVLYIDDNDH DDGVLRHIVA LRPGIRWLRA
ECGSKGMELL RRESVDLVFL DVHLPDMSGF DVLREVRSDP LTAALPVYMV SADATSGQAQ
RMSSLDATGF VPKPVDVPRL LGIVDSVLRA KSGGRDG