Gene Cthe_1498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1498 
Symbol 
ID4810648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1817744 
End bp1819552 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content40% 
IMG OID640106918 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001037919 
Protein GI125974009 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAGT CATTTAATAA TTTAAGAATT GGAGTCAGGA TAATTATAGG TTTTTTCATT 
ATTGTGGCGA TAGCGTGTAT TATAGGAGTT GTTGGAATTC AAAATCTTAA GAACGTTCAG
GATTCATATG CACTTGATTA TGAAAGTACT GTCGATGCAT TGGAGTATGT TGAAAGAATC
AGTTCTCATT TTCAGCAAAT AAGGGTAAAT GTCTTTGGTT TTGCATTGTC TTACGACTCA
GATGAAAAAA GGGAGTATTA TACTGAAAGG ATTGCAGAAC ATAAAAATAC AATCGATGAG
AGTATCAATG GCTATCGTGA GATATTGAGT AAGTATGATG CATCTGAAGT TGAAACGGAT
ATGAAGCTGC TTGATAATAT TCAAGCGGCA TTAAATGAAT TTGGAGTCCT AAGGAACAAG
ATGATGAACG ACCTTCAGAC CGGTTTGATA AGCAGGGAGG AATTTGTCTC TTCATTTTCA
AAAGGCGGAG AAGCTCACAA TTTGGCAAGT AATGTGGACA ATGCCATCCT GGAACTGATT
GACTATAATA TTGATTATGC GGCAAATCAG ATTTCAAAAA ACAAAAAACA GGCGGACAAC
TCAATTGGAT TAATGGTTAT GGTAATAGCT GTCGGCGCGG TATTTGCGCT TGTATTAGGT
CTTATTATCT CCAATGGTAT ATCCAAGCCT ATTACCAAAG TGGTTGCTGC TGCCGGCAAG
CTTGCCGAAG GAGATATGGA TATAACTTTT GATATTAATT CCAAAGATGA AACAGGAAAA
CTTGTGGATG CTTTCAGAAA TCTGGTCGAA AGTACGAAAA AGCAGGCATT TATAGTTGAA
AAAATCGCTG ACGGGGATCT TACGGTTGAT GTACCCATTC GTTCCCAAAA GGACTTGCTG
GGACAGAAGC TGTCTGAAAT GGTGCACAAT ATTAACAATT TGATAATGAA TATTGCTTCC
GCGGCCGAAC AGGTTTCAGC AGGAGCCAGA CAAATATCTG ATTCCAGCAT GGCACTTTCG
CAGGGTGCCA CGGAGCAGGC AAGCTCGATC GAAGAGCTGT CCGCTTCCAT AGAAGAAGTA
GCATCAAAAA CAAAGATAAA TGCCGATAAT GCAAATCAGG CCAATGACTT GGCTGAGAAA
GCAAAGACTT TTGCACTTAC CGGAAATGAT CATATGCAGG AAATGTTGAA AGCAATGGAT
GAGATTAATG AATCATCCAA TAATATAAAT AAAATTATCA AAGTAATAGA TGATATTGCT
TTTCAGACCA ACATACTGGC ACTAAATGCC GCAGTTGAGG CAGCCAGGGC GGGACAGCAC
GGCAAAGGTT TTGCTGTTGT TGCCGAAGAG GTCAGAACTC TTGCAGGACG TTCCGCCAAC
GCGGCAAAAG AAACGACGGC TTTGATTGAG GATTCAATAA AGAAAGTGGA AGTCGGAGCT
AAAATTGCGA AAGAAACTGC TGAAGCGTTG GAGAAGATTG TCAGTGGCGT AGAATCTGTG
TCAAATCTGG TAAGTGATAT AAATGAAGCT TCAAATGAGC AAGCCACTGC GATTGCTCAT
ATTAATCAGG GTATTACGCA GGTATCACAG GTAGTTCAGA AAAATTCAGC CACATCAGAG
GAAAGCGCAG CCGCAAGCGA GGAACTTTCA AGCCAGGCTG AAAGGCTCAA ACAGTTGGTG
GAAAAATTCA GGCTGAAGAA AACTTCTGTC ACCATGGATT CTTATGGAGA GCTTAATCCG
GAAATTATAG ATATTCTTGG ACAAATGAGC AAAAATAAGG AAAAAGAAGC GGAAATAGTT
TTGAATTGA
 
Protein sequence
MFKSFNNLRI GVRIIIGFFI IVAIACIIGV VGIQNLKNVQ DSYALDYEST VDALEYVERI 
SSHFQQIRVN VFGFALSYDS DEKREYYTER IAEHKNTIDE SINGYREILS KYDASEVETD
MKLLDNIQAA LNEFGVLRNK MMNDLQTGLI SREEFVSSFS KGGEAHNLAS NVDNAILELI
DYNIDYAANQ ISKNKKQADN SIGLMVMVIA VGAVFALVLG LIISNGISKP ITKVVAAAGK
LAEGDMDITF DINSKDETGK LVDAFRNLVE STKKQAFIVE KIADGDLTVD VPIRSQKDLL
GQKLSEMVHN INNLIMNIAS AAEQVSAGAR QISDSSMALS QGATEQASSI EELSASIEEV
ASKTKINADN ANQANDLAEK AKTFALTGND HMQEMLKAMD EINESSNNIN KIIKVIDDIA
FQTNILALNA AVEAARAGQH GKGFAVVAEE VRTLAGRSAN AAKETTALIE DSIKKVEVGA
KIAKETAEAL EKIVSGVESV SNLVSDINEA SNEQATAIAH INQGITQVSQ VVQKNSATSE
ESAAASEELS SQAERLKQLV EKFRLKKTSV TMDSYGELNP EIIDILGQMS KNKEKEAEIV
LN