Gene Cthe_0484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0484 
Symbol 
ID4808337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp596191 
End bp597369 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content37% 
IMG OID640105898 
Productflagellar biosynthetic protein FlhB 
Protein accessionYP_001036915 
Protein GI125973005 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1377] Flagellar biosynthesis pathway, component FlhB 
TIGRFAM ID[TIGR00328] flagellar biosynthetic protein FlhB 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0127914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGA AATATTTTCT GAAAGAAGTT TTATCGAAGT TTGGAAAAAA TAAAGAATTT 
AAAATCCATC AAACAAGTAC CGGAAGAATA AAGGCAAACT TACAGCTTTT TGCGGATAAC
GGCGAGAAAA CTGAAAAAGC CACTCCGAGA AAAAGATCAA AGGCAAGAGA AGAAGGTCAG
GTACTTCAGA GCAGGGAACT GAATTCGGCA ATAGTGCTTC TAAGTGCATT TGTAACACTA
AGGATTTTCG GACAATACAT GTATGAAGAA ATTTTAAAAT GCCTGAAAGT TGCTATAACC
GTTTATCCTT TAAATGACGG CTTGTTTACA ATTGATGGTT TGTTCGAACT TTATGGTGAA
ACAGTGATAA CTTTTTTGAA AATTGCCACT CCGGTGCTTA TGGTGGTGCT GATTGCAGGC
ATTGGTTCTG CATATGCCCA GGTTGGATTT TTGTTCACTA CCAAGACACT GGGAATAAAG
CTAAGCAGGA TTAATCCTTT AAAAGGATTT AAAAGGATTT TTTCACTAAA ATCTTTAGTT
GAGCTTTTAA AGTCGATTAT AAAAATTGCA CTAGTTGGTT ATATAGGCTA TGGATACATC
AAAGGTGAAA TGACAAATGT ATTGGGCATG ATGGACGTGG ATGTGGTAAG TTCTGCATCG
TTTCTTGGTT CTACAATTTT AAACGCGGCG ATAAGAATGT GTATTGCCTT TGTTGTTATA
GGTGTTGCCG ATTACGGTTA CCAGTGGTGG GAATATGAGA AGAGTCTTAG AATGTCAAAA
CAGGAAATCA AGGAAGAAAA TAAAGAAGTT GAAGGAAATC CGGAGATAAA ATCAAGAATA
AGGCAGAAGC AAAGACAGAT GTCAATGAGG AGAATGCTTC AGGATATCCC GAAAGCAGAT
GTGGTTATAA CAAACCCGAC GCATTATGCG GTAGCTATTA AATATGACCC TAAGGAAGCA
GATGCTCCTG TTGTTGTTGC AAAGGGACAG GATTATATGG CCTTGAGAAT AAAGGAAATA
GCAAAAGAGC ATAAAGTGGA AATTGTAGAA AACAAACCTC TTGCACAAAC ACTTTACAAA
ACCGTGGAAA TAGGCGGCAA AATACCTCCT GAGCTATACC AGGCGGTAGC CGAGGTTCTT
GCCTTTGTAT ATAGCTTAAA AGAAAAAATG AAGAAATAG
 
Protein sequence
MKLKYFLKEV LSKFGKNKEF KIHQTSTGRI KANLQLFADN GEKTEKATPR KRSKAREEGQ 
VLQSRELNSA IVLLSAFVTL RIFGQYMYEE ILKCLKVAIT VYPLNDGLFT IDGLFELYGE
TVITFLKIAT PVLMVVLIAG IGSAYAQVGF LFTTKTLGIK LSRINPLKGF KRIFSLKSLV
ELLKSIIKIA LVGYIGYGYI KGEMTNVLGM MDVDVVSSAS FLGSTILNAA IRMCIAFVVI
GVADYGYQWW EYEKSLRMSK QEIKEENKEV EGNPEIKSRI RQKQRQMSMR RMLQDIPKAD
VVITNPTHYA VAIKYDPKEA DAPVVVAKGQ DYMALRIKEI AKEHKVEIVE NKPLAQTLYK
TVEIGGKIPP ELYQAVAEVL AFVYSLKEKM KK