Gene Cthe_2287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2287 
Symbol 
ID4809876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2722448 
End bp2723380 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content36% 
IMG OID640107693 
ProductAraC family transcriptional regulator 
Protein accessionYP_001038682 
Protein GI125974772 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000984991 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTAT TTACAAGAAG TTACGATAAA GACTGGAACC GCATTGTATA CTTAAAAGTA 
TTGGAAAATC AAGAGGCATG CCTTCATCTG AAGGATGATA AGACTTGTAA ATTAATCCTT
TTGCTCGAAG GAAATCCCAC CATAAGATAT AATAATCAAA CAGCTTTTAT TATCGCACCC
TCTGTTCTTT GCCTGAATCA TTTGGACACC GTAGAATTTG ACTACAACCC CAACTGCAAG
ATGACAGTTC TCTTCTTTAG ACCAAGTGCC TTAAATGATC AGCTGGAATA CAGTGCATTT
CATTCAAATC TTTATGAAAA AATGGTTGGA ACAACGCTGT TTCAAGATAT GGTATTATTG
AGTTCATTTT ATGATATTAA CGATTCCAAA CGTGAGCTCA TCTTATTAGA CACTGTCTCA
GCTGTTAGTC TGATGCAATT ACTCAACAAA ATAAACCAAG AGCTGACAGA GCAGAAAGAT
GGCTATTGGC CATGCAGAAG CAGATCTTAT TTTATCGAAC TGCTCTTTTT CCTGGAAGGA
TTACGCTGTA ACGAGTCATT GCAGAAAATG CGCATCATAT TGGGCAAGAA TAAAAATAGT
ATTGTACATA ACATCATTCA GTATCTCAAT CAAAACATCG ACAAGAAAAT TACCCTTGAA
CATTTGGAAA AAACATTTGC ATGCAACCGA AATCAGATTA ATAAAGAATT TCAAAAGGAA
TTAAATACGA CAGTAATGAA ATATTTTACT CAAATGCGAA TGCAGCTGGC CAGCATCTTG
TTAAGAGACA CCGAAATACC GATACTCGAA GTTGCGCTAA GAGTAGGGTA TTCCGATGCC
GGTTATTTTT CCAAAACATT TAAATTATAT AGCGGTATAT CCCCCAGTGA ATATCGTAAT
TCCTTTTATT CTACGATATC ACCGGCTTTA TAG
 
Protein sequence
MDLFTRSYDK DWNRIVYLKV LENQEACLHL KDDKTCKLIL LLEGNPTIRY NNQTAFIIAP 
SVLCLNHLDT VEFDYNPNCK MTVLFFRPSA LNDQLEYSAF HSNLYEKMVG TTLFQDMVLL
SSFYDINDSK RELILLDTVS AVSLMQLLNK INQELTEQKD GYWPCRSRSY FIELLFFLEG
LRCNESLQKM RIILGKNKNS IVHNIIQYLN QNIDKKITLE HLEKTFACNR NQINKEFQKE
LNTTVMKYFT QMRMQLASIL LRDTEIPILE VALRVGYSDA GYFSKTFKLY SGISPSEYRN
SFYSTISPAL