Gene Cthe_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1106 
Symbol 
ID4811404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1315691 
End bp1316746 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content44% 
IMG OID640106528 
Producttwitching motility protein 
Protein accessionYP_001037531 
Protein GI125973621 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.321082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAA ATGATTTACT GAGGAAGACA GTCGAAAATG ATGCGTCCGA CTTGCATATT 
TGTGCGGGGG TGCCTCCGAT TATAAGGGTC TACGGCGATC TTATAAATTT AAACGAGCCG
GTGTTAACTC CGCATGATTG TGTGCAGCTT GCCAAGCAAT GTTTAAGCAG CAGAATGTAT
GAACGCTTTC TTGAAGTCGG AGAAGTCGAT GCATCGTACT CGGTTCCGGG AGTGGCACGA
TTCAGGGTAA ATGTTTTCAA GCAAAGAGGT ACTTGTGCAA TTGCATTCAG AGCAATACCT
ACAGAAGTTC CGTCAATTGA GCAGCTGGGG CTTCCCGGCT TGGTATATGA CCTTGCCATG
AAGCAGAGAG GTCTTGTTTT GATAACGGGA CCTACAGGTC ATGGAAAGTC CACTACACTG
GCTGCCATGA TTAATTACAT AAACAGCAAG AGAAGATGTC ACATAGTTAC GATTGAAGAC
CCTATAGAGT ATTTGCACAG ACATAACAAA AGTATTATAA ATCAGAGGGA AATAGGTTCT
GATACGAAAT CCTACGCAAA TGCGTTGAGA GCTGTATTAA GAGAAGACCC TGACGTTATA
CTTATCGGAG AGATGCGTGA CCAGGAGACC ATTGCAACGG CGCTTACGGC TGCTGAAACA
GGTCACTTGG TTTTGTCTAC TTTGCATACC GTCGGAGCAG CAAAGACCAT CGACAGAATT
ATCGACGTTT TCCCGCCTCA CCAGCAGTCA CAGGTAAGGG TTCAGCTTTC CACCGTGCTT
ATGGGTATCA TATCCCAGCA GCTTATAAAA AGAGCAAATG GCAAGGGACG TGTGCTGGCT
ACTGAGACCA TGGTTTGGAC GCCCGCTATT TCAAATCTTA TTCGTGAAAG CCGTACACCT
CAGATAAATA CTTGTATTCA GACGGGTTCT CAGTTTGGAA TGTATACCAT GGACAGTTGC
CTTGCAGAAC TCTATAAAGC CGGTGAAATT GACTATGAAA GTGCATGCCA GTATTCTGTT
GATATGGATA ATTTGAGAAA ATTAATCTCA AATTAA
 
Protein sequence
MDINDLLRKT VENDASDLHI CAGVPPIIRV YGDLINLNEP VLTPHDCVQL AKQCLSSRMY 
ERFLEVGEVD ASYSVPGVAR FRVNVFKQRG TCAIAFRAIP TEVPSIEQLG LPGLVYDLAM
KQRGLVLITG PTGHGKSTTL AAMINYINSK RRCHIVTIED PIEYLHRHNK SIINQREIGS
DTKSYANALR AVLREDPDVI LIGEMRDQET IATALTAAET GHLVLSTLHT VGAAKTIDRI
IDVFPPHQQS QVRVQLSTVL MGIISQQLIK RANGKGRVLA TETMVWTPAI SNLIRESRTP
QINTCIQTGS QFGMYTMDSC LAELYKAGEI DYESACQYSV DMDNLRKLIS N