Gene Cthe_2663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2663 
Symbol 
ID4808831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3141839 
End bp3143095 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content42% 
IMG OID640108078 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001039055 
Protein GI125975145 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.457369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGA TTAGAAGTAC CAACAGGCCT TTAAGGATAG CCTATTTGAT TTCTGCCGTA 
CTTGCAGTGG GCCACGGTAT ATTTATTTAT ATAATGTCAG GCGGAAGAAC GGCAGCTTTA
ATTACCAATG TATTGGTGTG CGCACTTGTG GCATTTTCAA TAAAAGTGGC AACTGATATT
ACATTCAGAA GAATTGTCAA CAGAATCAAT ACGGATATGG AGAAAATAAA CCAGGGAGAT
CTGTCCCACC TGATTGAAAC TAAAGACACC GGCGAAGTGA AAAAAATATC TGTGGCTGTC
AATTCCATGT TGCAGGATAT TTGTACATTG ATTGAAAGCT TCCTCTCTCT TTCGTCCCTT
ATTATGGAAT CCACGGAAAA AGTGAGTGCT GCTGCCGAGT CGGCATCGCA GGCCATGGAG
GAAATATCGA GAACTGTGGA GCAAATAGCA ACAGGAGCGT CATCCCAGGC AAATGAGGCA
CAACAAGGTG TACAGGTTAT GGATAAGCTG TCAGAGCAGA TCACACTGGT ATATCAAAAT
TATAACAGCA TCATAGATGA TACAAGGAAA ATCAGTGAAT TGAACAACAT TGGACTGCAG
TCGGTCAAAG TGTTGAGGGA CAAGTCCAAA GAGAACTATG AAACGACGGA AAAAATATTT
TCGGTTGTGG AAAAGCTTGC AGATGGGATA AAGGATATAG GAAACTTTGT TGAGTCCATT
GAAAATATAG CGGAACAAAC AAACCTGCTG GCACTGAATG CAGCGATAGA GGCGGCAAGG
GCCGGAGACG CGGGAAAAGG ATTTGCAGTG GTCGCCGATG AAGTCAGAAA GCTTGCGGAT
CAAAGCAGGA AATCCACGGA AGAGATAAAT TTGTTGGTGA ACAGTATACA GGAAGAATCC
GTATTGGCGA TAGAGTCCAT GGAAATAATG AGAAAAGTGT CGGCAGAGCA GAGTGAGGCC
GTCAATCAGA CGGACAATGC TTTCAGTGAT ATTGCAAATG CAATAGATTC CATAGTTTTA
AGAATTGAAA ATGTAAATCA GGCGGTTGAG AAAATGCAGA ATGACAAGGG AGAAGTAATT
GCCACGATTG AAAACATTTC AGCGGTTTGT GAGGAAACGG CGGCGTTCAG TAAAGAAGTG
GCGATGACAA CAGAGCATCA ATTGAAGTAT ATTGATGAGA TGAAGGAGGC TTCAAGCAGC
CTCAGCGGAC TTGTGAAGGA GCTTGATGCG AAATTGGCAA AGTATAAGAT AAAATAG
 
Protein sequence
MIKIRSTNRP LRIAYLISAV LAVGHGIFIY IMSGGRTAAL ITNVLVCALV AFSIKVATDI 
TFRRIVNRIN TDMEKINQGD LSHLIETKDT GEVKKISVAV NSMLQDICTL IESFLSLSSL
IMESTEKVSA AAESASQAME EISRTVEQIA TGASSQANEA QQGVQVMDKL SEQITLVYQN
YNSIIDDTRK ISELNNIGLQ SVKVLRDKSK ENYETTEKIF SVVEKLADGI KDIGNFVESI
ENIAEQTNLL ALNAAIEAAR AGDAGKGFAV VADEVRKLAD QSRKSTEEIN LLVNSIQEES
VLAIESMEIM RKVSAEQSEA VNQTDNAFSD IANAIDSIVL RIENVNQAVE KMQNDKGEVI
ATIENISAVC EETAAFSKEV AMTTEHQLKY IDEMKEASSS LSGLVKELDA KLAKYKIK