Gene Cthe_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2218 
Symbol 
ID4811083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2646253 
End bp2648769 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content41% 
IMG OID640107624 
Productflagellar hook-associated 2-like protein 
Protein accessionYP_001038613 
Protein GI125974703 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGTCA ATAACATATC AAATTTGATC AACAGCAGAA TAAGATTGAC AGGTATATCC 
TCGGGTCTTG ATACCGATGC TATTATAGAA CAGCTCATGA GCGTTGAGAG GGCAAAGGTT
GACAAGATAA AACAGGAGAA GCAGATACTG GAATGGAAGC GGGATATATA CAGGGATATA
ATAAACAAAT TGAGAAGTAT TACAGATGAG TATTTCAACG TTTTGAAGCC CAAAACAAAT
TTTACTTCTC AAAGTGCTTT TACATCCTTT AAAATAAGCT CAAGCAATGA GTCAGTTGTT
ACGGTAACCG CCAATGCATC GGCGGCTTCC AAGGTTCACA GCATAACGGT GCACTCCCTT
GCGTCTGCGG CAAAGATTGT AGGTACCTCA GGATTGGTTG ACGGTATTAA AGGGAGTAAT
GCGGTAAACA CTTTGTCGCT TCAGGGCAAA GAAATAAATG TTACCCTGGA CGGAGTTACA
AAGACAATAG CGCTGGAGGA TTATACCAGC CTCAGTGACC TTGAAACAAA ACTTGAGTCT
GCCCTGGCAA AAGCCTTTGG AACGGGAAAG ATAGATGTTG TCACAACAGG CGGCTCGATA
GAGTTTAAAT GTCTTTTAAA CGGCAGTACA TTAAGTATAA GCGATACAGC AAACAACTAT
ATTTCATCTT TAGGTTTTTC CAATGGACAG AAAAATTTCA TTACAGGAAA TTCGGATGTA
AACTCCGATT TTTCATTATA TACCGACGGC AGTTTTAAAA TAACAGTTGG AAACGGCACG
GCGCAAACCA TAAATATTTC AGATGCAACG AGTATAGATG ACCTTGTCGC AAAAATTCAG
CAAGCCATTG ACAGTAATTC AGAGCTGAGC GGTAAAGTGC ATGTGAGCAA TGACGGAAGC
AAATTAACCT TTATTTCTGT TTCGGGAGAA ACAGTGAAGC TGACTTCCGG AGATTCCAAC
AATGTGCTGG ACAAGCTGGG ATTTTCCGAC GGAGCCACTA TAACTGCAAC AAGCTCGACA
GTTATTGATT TGAGCGGAAA TGAAAAGGGT AAAACTTTTA TTATTAATAT AAATGGCGTT
GACAAAATCA TTGAAATAGA CAAGGACTAT AATGATTTGG ATGAGCTGGC ATCGTACATT
CAGAACCAGC TGGGAGGCAC TGTAAATGTA ACAAAAGATG CTTCCGGCAG CAGACTTGTT
TTTTCAACCG GAGGGGCGGA CAGACTGATA TTTAAGAAGG GTCCCGAGGA TGGACTGGAA
AAGCTGGGAT TTACCGCAAA TGACAACAGG AGCAACAGGA TATCTTTAAC GACAAAGCTT
GATTCATTAA GCACAATTTT CAAAAATGAT TTGAATATTG CAGATCCTGA TGCCAATGTT
GTTTTCACCA TAAACGGTCA AACCATTGAT GTGGGCAAGA CTTATGCAAA TGCAACATTA
AGTGATGTAA TGAATGCCAT TAATTCCAGC AGTGCAGGGG TCAAAATAAC CTATGACTCC
CTCAACGACA GGTTTATTAT GGAATCGAAA ACTATGGGAG CGACTTCGGA AATAGAATTA
ACCGATACAG ACCCTGCCAA TGGTTTGTTA AAAGCCATGG GACTTATCGG AGGAACCTAT
ACTGCCGGTA CGGATGCCGA GTTTGACTTG GACGGGGTTA CCGGCATGAA GCGAAGCACC
AATGAATTTA CCATAGAAGG AGTAACCTAC TCACTTAAGG GAGTCTCTTC CGAACCGGTA
AAGATTGATG TTAAGGCGGA CATAGATGCT GTTGTTGAAA ATATAAAGAA TTTTGTGAAC
AGCTATAATG AAATGCTTGC CAAAATCAAC TCTGTGCTTA CGGAAGAAAG ATACAGGGAT
TACCTGCCCC TCACGGACGA CCAGAAGAAA GCAATGAGTG AAGACGATAT AAAGTTATGG
GAGCAAAAAG CAAAGTCAGG TTTGCTAAGA AGCGACAGCA TATTGGAGAA TATTGTGACA
AACTTGAGGA GAGCTTTATA TGACAAGGTG GAAGGATGTT CCCTGAGCCT TTATCAGATA
GGAATTACAA CCGGATCATA CCAGGATAAA GGAAAACTTG TCATAGACGA AGAAAAGCTC
AGGGCGGCAC TTACTGATAA TTATGACGCA GTGGTCCAGC TCTTTACCCA GGGCTCACAA
TATACATACA GCGAGGCTTT AAACGACCCG AACAAAAGGG CTGTAAGATA CAAGGAAGCC
GGAATAGCCC AAAGGATTTA TGACATACTC CAGGACAACA TAAGGATAAC AAGAAATGCC
AACGGAAAGA AAGGTATCTT GCTTGAAAAA GCGGGAATTG CAGGAGATTT GACGGAATAT
GACAACTTAA TAGTGAATGA AATTAAGGCA AAAGAGACCT TGATTGACGA AATGCTTGTA
AAAATCTATA AAAAAGAAGA ATATTATTAC AGCAAGTTCG CGGCAATGGA AAAAATGCTC
GATGCAATGA ACAGCCAGTC AATGTGGTTG ACGCAGCAAT TTTCAAATTA TTATTAA
 
Protein sequence
MAVNNISNLI NSRIRLTGIS SGLDTDAIIE QLMSVERAKV DKIKQEKQIL EWKRDIYRDI 
INKLRSITDE YFNVLKPKTN FTSQSAFTSF KISSSNESVV TVTANASAAS KVHSITVHSL
ASAAKIVGTS GLVDGIKGSN AVNTLSLQGK EINVTLDGVT KTIALEDYTS LSDLETKLES
ALAKAFGTGK IDVVTTGGSI EFKCLLNGST LSISDTANNY ISSLGFSNGQ KNFITGNSDV
NSDFSLYTDG SFKITVGNGT AQTINISDAT SIDDLVAKIQ QAIDSNSELS GKVHVSNDGS
KLTFISVSGE TVKLTSGDSN NVLDKLGFSD GATITATSST VIDLSGNEKG KTFIININGV
DKIIEIDKDY NDLDELASYI QNQLGGTVNV TKDASGSRLV FSTGGADRLI FKKGPEDGLE
KLGFTANDNR SNRISLTTKL DSLSTIFKND LNIADPDANV VFTINGQTID VGKTYANATL
SDVMNAINSS SAGVKITYDS LNDRFIMESK TMGATSEIEL TDTDPANGLL KAMGLIGGTY
TAGTDAEFDL DGVTGMKRST NEFTIEGVTY SLKGVSSEPV KIDVKADIDA VVENIKNFVN
SYNEMLAKIN SVLTEERYRD YLPLTDDQKK AMSEDDIKLW EQKAKSGLLR SDSILENIVT
NLRRALYDKV EGCSLSLYQI GITTGSYQDK GKLVIDEEKL RAALTDNYDA VVQLFTQGSQ
YTYSEALNDP NKRAVRYKEA GIAQRIYDIL QDNIRITRNA NGKKGILLEK AGIAGDLTEY
DNLIVNEIKA KETLIDEMLV KIYKKEEYYY SKFAAMEKML DAMNSQSMWL TQQFSNYY