Gene Cthe_2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2243 
Symbol 
ID4809981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2670009 
End bp2671568 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content42% 
IMG OID640107649 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001038638 
Protein GI125974728 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTT TCTTCGGATT TAACGTGGCA GTAAAGGGAT TGTTCACGGC TCAAAGAAAT 
ATGGATATAA TAAACCACAA TATTAACAAC GTAAACACAC CGGGATATTC CAGGCAGGTA
GCGATTCAGT CGGCATCAAA CCCCATATCG CTCCTTAACG GAACGGGAAT GCTGGGTACA
GGCTCGGAAG TGCTTGCCAT TGAAAGAATA AGGGATGAAT ACCTTGATTA CAAGTACTGG
AGTGAAAACA TTTCTTACGG GGAATGGAAT GCCAAAAGGA CTCTTCTTGC CGATATGGAA
GTTACATTCA ATGAACCGTC GGACAGCGGA TTTAATGCTG TGATAAACAG TTTTTACAAT
TCTTTGCAGG AACTTGCAAA AGACCCCAGC AGCGATGCGA TCAGGGCTTT GGTAAAAGAA
CAGGGCGTAA CCTTTGCGCG CTACTTTAAC AACATTGCAT CCCATTTTGA AGAGCTTCAG
TTTGATATAA ACAATCAGGT AAAGACTGTT GTTACCGAGA TAAACTCATT AGGCACTCAA
ATAGCCCAGT TGAACAAGCA AATATACACT GCCGAGCTGG ACGGAAATAC TGCAAATGAC
TTAAGAGACA AAAGGACACT TTTGATTGAT GAGCTTTCAA AGCTTGTAAA CATAGATGTC
AACGAAGTAG TGGTGGGCAA ACTCCCCAAC GGCAAAGACG ACAAACGAAT GATAATTACG
ATCAGCGGAA AGGCTTTTGT AGACAACTTT GATGTCAACA AGCTCACCAT TAAGCAAAGA
GAGAATAAAC TGAATGAGGA AGAGGATATT CCAAATCTGT ACGAAGTTTT GTGGGAAGAC
GGCAACAGCC TTAGTGTCAG GGGCGGAGAG CTGAAAGGAC TGCTTGATGT TAGGGATGGA
AATGACGGAG AAAACGGAAG TCCCAACTAT AAAGGCATAC CCTATTACAT AAGGAAACTT
AATGAATTTG TCAGGACTTT TGCAATAGCA TTTAACGAAG GAATAGTGGG AAACGACAAG
GTGGCGGGTC ATGTGGACGG ATATGGCACC AACGGAAATA CAGGAATAAG ATTTTTCACC
ATATTGGGAG AGGAAAACAA ACCTATATCC AGTGCTGATT TCATGTCCGC CGGAGATATT
GACGCCTGCT ACGGGAAAAT GACCGCCAAA AACTTTACCG TAAGTAGCGA CATTCTGGAT
GATCCAAGAA ACATTGCAAC AGCTGATACA AAGGATCAGG TCGGGAACAT AGGAAATATA
AACAGTATCC TGGCTATGAG AAATAACGTC CATATGTTCA AGGAAGGTGC TCCGGAGGAC
TTTGTAAAAT CTGTTATTAC AACTCTGGCG ATAGATTCTC AGCAAACTAT AAGGCTTTCG
TCAATCCATA AAAATATGAT TGAACAGGTT GAAAATCAGA GGATGTCTGT GTCAGGGGTG
TCTTTGGACG AAGAAGTGGC AAATCTGGTA AAGCACCATC AGGCTTATGC GGCGGCCGCA
CAAATGATTA ATACCATGGC GGAAGTTTAC GACATACTGA TTAACAGAGT AGGTCTTTGA
 
Protein sequence
MSSFFGFNVA VKGLFTAQRN MDIINHNINN VNTPGYSRQV AIQSASNPIS LLNGTGMLGT 
GSEVLAIERI RDEYLDYKYW SENISYGEWN AKRTLLADME VTFNEPSDSG FNAVINSFYN
SLQELAKDPS SDAIRALVKE QGVTFARYFN NIASHFEELQ FDINNQVKTV VTEINSLGTQ
IAQLNKQIYT AELDGNTAND LRDKRTLLID ELSKLVNIDV NEVVVGKLPN GKDDKRMIIT
ISGKAFVDNF DVNKLTIKQR ENKLNEEEDI PNLYEVLWED GNSLSVRGGE LKGLLDVRDG
NDGENGSPNY KGIPYYIRKL NEFVRTFAIA FNEGIVGNDK VAGHVDGYGT NGNTGIRFFT
ILGEENKPIS SADFMSAGDI DACYGKMTAK NFTVSSDILD DPRNIATADT KDQVGNIGNI
NSILAMRNNV HMFKEGAPED FVKSVITTLA IDSQQTIRLS SIHKNMIEQV ENQRMSVSGV
SLDEEVANLV KHHQAYAAAA QMINTMAEVY DILINRVGL