Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2218 |
Symbol | |
ID | 4811083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2646253 |
End bp | 2648769 |
Gene Length | 2517 bp |
Protein Length | 838 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107624 |
Product | flagellar hook-associated 2-like protein |
Protein accession | YP_001038613 |
Protein GI | 125974703 |
COG category | [N] Cell motility |
COG ID | [COG1345] Flagellar capping protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGTCA ATAACATATC AAATTTGATC AACAGCAGAA TAAGATTGAC AGGTATATCC TCGGGTCTTG ATACCGATGC TATTATAGAA CAGCTCATGA GCGTTGAGAG GGCAAAGGTT GACAAGATAA AACAGGAGAA GCAGATACTG GAATGGAAGC GGGATATATA CAGGGATATA ATAAACAAAT TGAGAAGTAT TACAGATGAG TATTTCAACG TTTTGAAGCC CAAAACAAAT TTTACTTCTC AAAGTGCTTT TACATCCTTT AAAATAAGCT CAAGCAATGA GTCAGTTGTT ACGGTAACCG CCAATGCATC GGCGGCTTCC AAGGTTCACA GCATAACGGT GCACTCCCTT GCGTCTGCGG CAAAGATTGT AGGTACCTCA GGATTGGTTG ACGGTATTAA AGGGAGTAAT GCGGTAAACA CTTTGTCGCT TCAGGGCAAA GAAATAAATG TTACCCTGGA CGGAGTTACA AAGACAATAG CGCTGGAGGA TTATACCAGC CTCAGTGACC TTGAAACAAA ACTTGAGTCT GCCCTGGCAA AAGCCTTTGG AACGGGAAAG ATAGATGTTG TCACAACAGG CGGCTCGATA GAGTTTAAAT GTCTTTTAAA CGGCAGTACA TTAAGTATAA GCGATACAGC AAACAACTAT ATTTCATCTT TAGGTTTTTC CAATGGACAG AAAAATTTCA TTACAGGAAA TTCGGATGTA AACTCCGATT TTTCATTATA TACCGACGGC AGTTTTAAAA TAACAGTTGG AAACGGCACG GCGCAAACCA TAAATATTTC AGATGCAACG AGTATAGATG ACCTTGTCGC AAAAATTCAG CAAGCCATTG ACAGTAATTC AGAGCTGAGC GGTAAAGTGC ATGTGAGCAA TGACGGAAGC AAATTAACCT TTATTTCTGT TTCGGGAGAA ACAGTGAAGC TGACTTCCGG AGATTCCAAC AATGTGCTGG ACAAGCTGGG ATTTTCCGAC GGAGCCACTA TAACTGCAAC AAGCTCGACA GTTATTGATT TGAGCGGAAA TGAAAAGGGT AAAACTTTTA TTATTAATAT AAATGGCGTT GACAAAATCA TTGAAATAGA CAAGGACTAT AATGATTTGG ATGAGCTGGC ATCGTACATT CAGAACCAGC TGGGAGGCAC TGTAAATGTA ACAAAAGATG CTTCCGGCAG CAGACTTGTT TTTTCAACCG GAGGGGCGGA CAGACTGATA TTTAAGAAGG GTCCCGAGGA TGGACTGGAA AAGCTGGGAT TTACCGCAAA TGACAACAGG AGCAACAGGA TATCTTTAAC GACAAAGCTT GATTCATTAA GCACAATTTT CAAAAATGAT TTGAATATTG CAGATCCTGA TGCCAATGTT GTTTTCACCA TAAACGGTCA AACCATTGAT GTGGGCAAGA CTTATGCAAA TGCAACATTA AGTGATGTAA TGAATGCCAT TAATTCCAGC AGTGCAGGGG TCAAAATAAC CTATGACTCC CTCAACGACA GGTTTATTAT GGAATCGAAA ACTATGGGAG CGACTTCGGA AATAGAATTA ACCGATACAG ACCCTGCCAA TGGTTTGTTA AAAGCCATGG GACTTATCGG AGGAACCTAT ACTGCCGGTA CGGATGCCGA GTTTGACTTG GACGGGGTTA CCGGCATGAA GCGAAGCACC AATGAATTTA CCATAGAAGG AGTAACCTAC TCACTTAAGG GAGTCTCTTC CGAACCGGTA AAGATTGATG TTAAGGCGGA CATAGATGCT GTTGTTGAAA ATATAAAGAA TTTTGTGAAC AGCTATAATG AAATGCTTGC CAAAATCAAC TCTGTGCTTA CGGAAGAAAG ATACAGGGAT TACCTGCCCC TCACGGACGA CCAGAAGAAA GCAATGAGTG AAGACGATAT AAAGTTATGG GAGCAAAAAG CAAAGTCAGG TTTGCTAAGA AGCGACAGCA TATTGGAGAA TATTGTGACA AACTTGAGGA GAGCTTTATA TGACAAGGTG GAAGGATGTT CCCTGAGCCT TTATCAGATA GGAATTACAA CCGGATCATA CCAGGATAAA GGAAAACTTG TCATAGACGA AGAAAAGCTC AGGGCGGCAC TTACTGATAA TTATGACGCA GTGGTCCAGC TCTTTACCCA GGGCTCACAA TATACATACA GCGAGGCTTT AAACGACCCG AACAAAAGGG CTGTAAGATA CAAGGAAGCC GGAATAGCCC AAAGGATTTA TGACATACTC CAGGACAACA TAAGGATAAC AAGAAATGCC AACGGAAAGA AAGGTATCTT GCTTGAAAAA GCGGGAATTG CAGGAGATTT GACGGAATAT GACAACTTAA TAGTGAATGA AATTAAGGCA AAAGAGACCT TGATTGACGA AATGCTTGTA AAAATCTATA AAAAAGAAGA ATATTATTAC AGCAAGTTCG CGGCAATGGA AAAAATGCTC GATGCAATGA ACAGCCAGTC AATGTGGTTG ACGCAGCAAT TTTCAAATTA TTATTAA
|
Protein sequence | MAVNNISNLI NSRIRLTGIS SGLDTDAIIE QLMSVERAKV DKIKQEKQIL EWKRDIYRDI INKLRSITDE YFNVLKPKTN FTSQSAFTSF KISSSNESVV TVTANASAAS KVHSITVHSL ASAAKIVGTS GLVDGIKGSN AVNTLSLQGK EINVTLDGVT KTIALEDYTS LSDLETKLES ALAKAFGTGK IDVVTTGGSI EFKCLLNGST LSISDTANNY ISSLGFSNGQ KNFITGNSDV NSDFSLYTDG SFKITVGNGT AQTINISDAT SIDDLVAKIQ QAIDSNSELS GKVHVSNDGS KLTFISVSGE TVKLTSGDSN NVLDKLGFSD GATITATSST VIDLSGNEKG KTFIININGV DKIIEIDKDY NDLDELASYI QNQLGGTVNV TKDASGSRLV FSTGGADRLI FKKGPEDGLE KLGFTANDNR SNRISLTTKL DSLSTIFKND LNIADPDANV VFTINGQTID VGKTYANATL SDVMNAINSS SAGVKITYDS LNDRFIMESK TMGATSEIEL TDTDPANGLL KAMGLIGGTY TAGTDAEFDL DGVTGMKRST NEFTIEGVTY SLKGVSSEPV KIDVKADIDA VVENIKNFVN SYNEMLAKIN SVLTEERYRD YLPLTDDQKK AMSEDDIKLW EQKAKSGLLR SDSILENIVT NLRRALYDKV EGCSLSLYQI GITTGSYQDK GKLVIDEEKL RAALTDNYDA VVQLFTQGSQ YTYSEALNDP NKRAVRYKEA GIAQRIYDIL QDNIRITRNA NGKKGILLEK AGIAGDLTEY DNLIVNEIKA KETLIDEMLV KIYKKEEYYY SKFAAMEKML DAMNSQSMWL TQQFSNYY
|
| |