Gene Cthe_2819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2819 
Symbol 
ID4809656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3330830 
End bp3335254 
Gene Length4425 bp 
Protein Length1474 aa 
Translation table11 
GC content46% 
IMG OID640108239 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001039211 
Protein GI125975301 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTTA GAAGACATAA GGCTGTAAAA TTGAAAAATT CAGAGCTTTC TCAGAAAATC 
TACGAAACAG TCAGCTCAGC TAAAAAACTC GATGAAAAAT TAAAGGATTT GTCCGGTATC
ACCGAGCAGG TTGCAAAAGG TGCAAGTGAA CAGTCTTCAG CCGTTGATTC AACAAATTCA
ATGGCCGAAA GGCTTTCCCA GTCAATTAAC GGTGTTGTTA AAAATACCGA AAAACTGAAG
GAGCTTGCCG GAGAGTCCAA AAAATCTCTT GAAAGTATGA TTGGCACAAT AAAACTGGTT
GCAGGCAACA GTGAAAGTAC TGCAAAATCG GTAGAAGAAA TATCTTCTTC CATAGAGCAG
ATGGGAAAAT CCATAAAAGG CGTTGCAGGC AATGCTGAAA GCCTCACAGG CTCGGCTGAA
GAAATTTCGT CGGCTATACA GAAGATAGTG GGTTCCATAG AGCAGGTGGC CGGTAATGCC
GAAAGCACTG CAAGTTCTGT AGAGCAGATT TCATCTTCTA TTGAGCAGAT GAGTAAGTCC
ATAAAAGGTG TGGCAGGGAA CGCCGAGAGC CTTACCGGTT CAGCCAGGGA GGCTACAGCG
GCAGTTCAGG AAATGGCCGC ATCCATTCAG CAGGTAGCGG GAAACAGCGA AAATACTGCA
AGTTCCATAG AACGTATATC TTTTTCCATA GAGCAGATGG GAAGATCTAT CAATGGAGTT
GCCGTAAATG CGGAGAGTCT CTCAAATTCG GCGGAGGAGG CAACAAATGC CATTCATTCA
ATGGTTTCAT CAATTCAACA GGTGGCAGGA AATGCCGAAA GTACTGCAAG CTCTATAGAG
CAGATATCAT CTTCCATAGA GCAAATGGGT AAATCCATCA AAGGAGTGGC AGGCAATGCT
GAAAGTACGT CAGGATCGGC TCAGGAAGCC TCGGCGGCAA TACAGGAGAT TGTAGCGTCA
ATCCAGCAGG TGGCGGGAAA TGCAGAGAGT ACGGCAAGTT CCATAGAGCA AATATCATCT
TCCATAGAGC AAATGGGTAA ATCAGTAAAA GGAGTGGCAG GAAACGCCGA GAGCCTAAAA
GGTTCTGCAG ATGAGGCCTC GGCTGCGATA CAGGAGATAG TTGCGTCCAT CCAGCAGGTG
GCAGGAAATG CAGAGAGTAC TGCAAGCTCT GTAGAGCAGA TTTCATCTTC CATTGAACAA
ATGAGCAAGT CAATAAAAAG TGTTGCAGGA AACGCCGAAA GTCTGTCGGG TTCGGCCGAA
GAAGCGTCAG CTGCCGTACA TGAAATAATA TCTTCCATAC AGCAGGTAGC GGGAAATAGC
GAAAGTACAG CCAAATCGGT TGAGGAAATA GCCGCATCCA TAGAGAAAAT GGGTAGGTCC
ATACAGGGCG TTGCAGGCAA TGCGGAGAGC CTAAAGGGAT CGGCAGACGA ATCCTCGGCT
GCGATACAGG AGATAGTTGC GTCCATCCAG CAGGTGGCAG GCAATGCGGA AAGTACAGCA
AGTTCCGTGG AACAAATATC TTCCTCTATA GAGCAAATGG GCAAATCCAT AAAAGGTGTT
GCAGGCAATG CCGAGAGTCT AAAGGGATCC GCAGACGAAT CTGCTGCTGC AATACAGGAG
ATAGTTGCTT CCATACAGCA GGTGGCAGGC AATGCGGAGA GCACAGCAAG TTCTGTGGAA
CAAATATCTT CTTCAATAGA GCAAATGGGT AGGTCGATAG AAGGTGTTGC AGGAAATGCC
GAAAGTTTAA AGGCTTCGGC TGACGAGGCC TCGGCTGCGA TACAGGAGAT AGTTGCGTCC
ATCCAGCAGG TGGCAGGTAA CGCAGAAAGT ACAGCCAAAT CTGTTGAAGA AATATCTTCT
TCGATAGAAG AAATGGGCAA ATCAATCCAA GGTGTTGCCG GGAATGCTGA ACAGCTTCAA
AAATCAGCCA ATGAAACATC CAAGGTAGTT GAAAACATGG CAGCCTCTAT AAGCCAGGTT
GCCCAGAATG CCCAGAATGT AAATGAATTG AGCGAAAAAG TGAGAAGTGA TGCTATAGAT
GGCCAGAAAG CAGTGGCTGA TACTTTAGTG GCTATCAAGG ACATATCTGA AGTTATTCAT
CGTGCGGAGG ATGTTATAAA CGGCCTCGGG AAGAGCTCCG AAAAAATAGG CAGTATTATA
GAGGTGATTG ATGATATTGC CGAACAGACA AATCTTCTTG CACTGAACGC GGCCATTGAG
GCTGCAAGAG CTGGTGAACA CGGCAAGGGA TTTGCAGTTG TTGCCGACGA GGTAAGAAAA
TTGGCCGAAC GGACAGCCAC AGCAACAAAG GAAATATCCG AACTGATTAA GGGCATTCAG
GGAGAGACAA GTCAGGCCAT CAAGGCCATA GAAGTCGGCA CCCAAAAGGT TGAACACGGT
TCAAAGCTGA GTGATGAGGC CGGAAAAGCT ATTGAGAAAA TAGTTGAGGG AATAGAGAAT
GTAAACGTGG AAATCCGCCA GATAACTGCC GCCACCGAAG AGCAGAACAA GGGCAGCATG
AAGATTATTG ATGCTGTGAA TATGGTGACA AATCAGGCAG CCCAGGTAAC CCAGGCTACC
AAAGAGCAGG CGGCAAGTGT GGAAAATATA GTAAGAGGTG TGGCCAATGC AAGGGAGCAG
GTAAGACAGG TCACTGTGGC AGTAAAGGAA CAGGCAAAAC AGGGTCAAAA TATCATAACT
GCCGTGGAAA ATGTGACAAA TCAGGCAGCC CAAGTGACCC AGGCCACTAA AGAACAGACA
AAAGGTGTTG AGGAAATAAT AAAAGGTGTG GCCAATGCGA GAGAACAGGT AAGGCAGGTT
ACTCTTGCAG TAAAAGAACA GGCAAAGCAA GGTCAAAATA TTGTCCAGTC CATTGAAAAC
GTAACTCAGC AAACTGCCCA GGTTGCAGCT GCCGTAAAAG AACAGACAGC CGGTGTCGAG
GAAATAATAA AAGGTGTGGC CAATGCCCGT GAGCAAATAA GACAGGTTAC GGCAGCAATG
AAAGAACAGG CCAAACAGAG CCAGAATATA GTGGTTGCCG TGGAAAATGT GACCAAACAG
GCTGCAGAGG TTACTCAGGC AACCAAAGAG CAGGCAAAAG ACATTGAAGA CATAATAAGA
GGTATTGAAA ATGCAAGAGA ACAGATGCGT CAAGTCACTC TTGCCGTGAA AGAACAGGCA
AATCATGGCC AGAACATAGT GGTTGCTGTG GAAAATGTGA CAAAACAGGC GGCACAGGTA
ACTGATGTAA CCAAAGAACA GGCAAAGGGC GTTGAGGATA TTGTAAAAGG CATAGAGAAC
TCCAGGGAAC AGGTAAGACA AATTACGGAA GCAGTGAAAG AACAGGCAAA ACAAGGTCAG
AATATCGTGG TCGCAGTAGA GAATGTAAAC AGACAGGCAG CCCAGGTGGC CCAGGCAGCA
AAAGAGCAGA CTCAGGGCGT TGAAGAAATC ATCAAGGGTG TGTTGAATGC AAGAGAACAA
GTAAGGCAGA TTACTGCTGC GGTGAAGGAA CAGGCACAAC AGGGACAAAA TATTGTGACA
GCCGTGGAAA ATGTTACCAA TCAGGTATCG CAGGTAACCC AGGCTGCCAA AGAGCAGGCT
CAAGGTGTTG AACAAATTAT CAAAGGTGTA ATAAATGCCC GGGAGCAGGT AAAACAGATT
ACCACTGCGG TAAAAGAAGA GGCTTTGCAG GGTCAGATTG TTATTAGGGC TGTAGAAAAT
GTTACCAACC AGGCTGCCCA GGTTACACAG GCTGTAAAAG AACAGGCAGC GGGAGTTGAG
GAAATTATCA AGAGCGTTGC CGATGCAAGA GAACAGGTAC GCCAGATTAC CGCAGCTGTC
AAGGAACAGG CAAAACAGGG ACAGGATATT ACCGCATCTG TTCAAAATGT CAGCGAACAG
GCGGCTCTTG TGACAAATGC AGTGAAAGAA CAGACTCAAG GTATTGAAGA CATAATAAGA
GGTATTGAGA ATGCAAGAGA GCAGATGCGT CAGGTTACCA CCACAGTAAA AGAACAGGCA
AAGCAGGGAC AGAATATAGT TACTGCGGTT GAAAACGTTG CAAACCAGGC TGCTCAAATT
ACCAATGTGA CAAAAGAACA GGCGCATGGC GTTGAGGATA TAATTAAAAG TGTTGCAAAT
GCCCGTGAAC AGGTAAAACA GATTTCAGCA GCCATGAAGG AACAGGCAGT AAATGCTGAC
AAGGTTATTG GCAACGTTGA GGGCGTAACC GTTCAGGCAA ATGAAGTGGC CGATGCTGCC
AAAGTTCAGG CAAATGAGGT AGAACAACTG GCGAAATATA CATCTGAAAT AGATCGTGTT
ATTAACCTCA ATATCAAGGA TGTTGAGCGT ACTTGCGCGG TAGCGAAAGA ATTGGCGGAA
TACTCGGAAG AGATTATAGG CGCACTGCAG GAACTGTCTA AATAG
 
Protein sequence
MGFRRHKAVK LKNSELSQKI YETVSSAKKL DEKLKDLSGI TEQVAKGASE QSSAVDSTNS 
MAERLSQSIN GVVKNTEKLK ELAGESKKSL ESMIGTIKLV AGNSESTAKS VEEISSSIEQ
MGKSIKGVAG NAESLTGSAE EISSAIQKIV GSIEQVAGNA ESTASSVEQI SSSIEQMSKS
IKGVAGNAES LTGSAREATA AVQEMAASIQ QVAGNSENTA SSIERISFSI EQMGRSINGV
AVNAESLSNS AEEATNAIHS MVSSIQQVAG NAESTASSIE QISSSIEQMG KSIKGVAGNA
ESTSGSAQEA SAAIQEIVAS IQQVAGNAES TASSIEQISS SIEQMGKSVK GVAGNAESLK
GSADEASAAI QEIVASIQQV AGNAESTASS VEQISSSIEQ MSKSIKSVAG NAESLSGSAE
EASAAVHEII SSIQQVAGNS ESTAKSVEEI AASIEKMGRS IQGVAGNAES LKGSADESSA
AIQEIVASIQ QVAGNAESTA SSVEQISSSI EQMGKSIKGV AGNAESLKGS ADESAAAIQE
IVASIQQVAG NAESTASSVE QISSSIEQMG RSIEGVAGNA ESLKASADEA SAAIQEIVAS
IQQVAGNAES TAKSVEEISS SIEEMGKSIQ GVAGNAEQLQ KSANETSKVV ENMAASISQV
AQNAQNVNEL SEKVRSDAID GQKAVADTLV AIKDISEVIH RAEDVINGLG KSSEKIGSII
EVIDDIAEQT NLLALNAAIE AARAGEHGKG FAVVADEVRK LAERTATATK EISELIKGIQ
GETSQAIKAI EVGTQKVEHG SKLSDEAGKA IEKIVEGIEN VNVEIRQITA ATEEQNKGSM
KIIDAVNMVT NQAAQVTQAT KEQAASVENI VRGVANAREQ VRQVTVAVKE QAKQGQNIIT
AVENVTNQAA QVTQATKEQT KGVEEIIKGV ANAREQVRQV TLAVKEQAKQ GQNIVQSIEN
VTQQTAQVAA AVKEQTAGVE EIIKGVANAR EQIRQVTAAM KEQAKQSQNI VVAVENVTKQ
AAEVTQATKE QAKDIEDIIR GIENAREQMR QVTLAVKEQA NHGQNIVVAV ENVTKQAAQV
TDVTKEQAKG VEDIVKGIEN SREQVRQITE AVKEQAKQGQ NIVVAVENVN RQAAQVAQAA
KEQTQGVEEI IKGVLNAREQ VRQITAAVKE QAQQGQNIVT AVENVTNQVS QVTQAAKEQA
QGVEQIIKGV INAREQVKQI TTAVKEEALQ GQIVIRAVEN VTNQAAQVTQ AVKEQAAGVE
EIIKSVADAR EQVRQITAAV KEQAKQGQDI TASVQNVSEQ AALVTNAVKE QTQGIEDIIR
GIENAREQMR QVTTTVKEQA KQGQNIVTAV ENVANQAAQI TNVTKEQAHG VEDIIKSVAN
AREQVKQISA AMKEQAVNAD KVIGNVEGVT VQANEVADAA KVQANEVEQL AKYTSEIDRV
INLNIKDVER TCAVAKELAE YSEEIIGALQ ELSK