Gene Cthe_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0468 
Symbol 
ID4808321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp582774 
End bp584087 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content44% 
IMG OID640105882 
Productflagellar protein export ATPase FliI 
Protein accessionYP_001036899 
Protein GI125972989 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1157] Flagellar biosynthesis/type III secretory pathway ATPase 
TIGRFAM ID[TIGR01026] ATPase FliI/YscN family
[TIGR03496] flagellar protein export ATPase FliI
[TIGR03497] flagellar protein export ATPase FliI 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCA TTGATTTTTC AAAATATTAT GATGTCTTGG ATAACAGGGA CTTTATCGAG 
TATACCGGAA AGGTTTCCAA AGTTGTGGGA CTTACAATCG AATCCAACGG TCCCGAGGTT
AACATTGGTG AAATCTGCAA GATAAACGCC TTAAGGGAAA ACAAGGTTAT ATCTGCGGAA
GCAGTGGGGT TTCGGGACAA TAAAGTGTTA CTTATGCCTT TAGGTGACAT GAACGGAATA
GGTCCCGGAA GCAAAGTGGT GGCAACCAGG GATTATCTGT CCGTTGGTGT CGGCAATGCA
CTTATAGGAA GAGTAATTGA CGGAATGGGA AGGCCCATTG ACGGCAAAGG TGAGATTGTT
ACCGAGACTA CATATCCCGT TGAAAATAAA CCTCCGCATC CTTTAAAAAG GAACAGAATC
AAGGAACCAT TGCCTTTGGG AGTGAAGACT ATAGACGGTC TTTTGACTGT GGGCAAGGGA
CAAAGAGTAG GTATTTTTGC GGGAAGCGGC GTGGGAAAGA GTACTTTAAT CGGAATGATT
GCCCGAAATA CAAAGGCTGA TGTCAACGTA ATCGCCCTTA TTGGTGAAAG AGGTAGGGAA
GTAAGAGAAT TTATTGAAAA AGACCTGAAA GAGGAAGGAT TGAAAAGATC TGTTGTGGTG
GTTGCAACTT CCGACCAGCC TGCACTTATC AGACTTAAAG GCGCGCTTAT GGCAACGGCC
ATAGCAGAGT ATTTCAGAGA TCAGGGAAAA GACGTACTTT TGCTTATGGA TTCTCTTACG
AGGTTTGCCA TGGCCCAGAG GGAAATTGGG CTGTCAATTG GCGAACCACC GGTTTCAAGA
GGTTACACCC CTTCAGTGTT TTCGATAATG CCAAAGCTGC TGGAACGGGC CGGAAATTCT
CAATCAGGGT CGATTACCGG ACTGTATACG GTGTTGGTTG ACGGTGACGA CCTGACTGAA
CCCGTGACCG ATACTGCCAG GGGAATTCTT GACGGACATA TTGTTTTGTC CAGAAACCTG
GCAAATAAAA ACCAGTACCC TGCCATTGAC GTTCTGGCGA GTGTGAGCAG GGTTATGCCG
GATATAGTGG ACGATGAACA TCAAAAGATT GCAAACGATA TCAAAAAGAC CATGGCGATA
TACAGAGAAG CTGAGGATTT GATTAATGTC GGTGCTTATG CAAAAGGAAG CAATGAAAAA
ATTGACTATG CCATTGAGGT AATTGACAAG ATACAAGAAT TTATTAAACA GGGTGTCCAT
GAACGATACT CTTATGAAGA GACTATAAAT CTGATGAAAA ATGTTTTGAT TTGA
 
Protein sequence
MATIDFSKYY DVLDNRDFIE YTGKVSKVVG LTIESNGPEV NIGEICKINA LRENKVISAE 
AVGFRDNKVL LMPLGDMNGI GPGSKVVATR DYLSVGVGNA LIGRVIDGMG RPIDGKGEIV
TETTYPVENK PPHPLKRNRI KEPLPLGVKT IDGLLTVGKG QRVGIFAGSG VGKSTLIGMI
ARNTKADVNV IALIGERGRE VREFIEKDLK EEGLKRSVVV VATSDQPALI RLKGALMATA
IAEYFRDQGK DVLLLMDSLT RFAMAQREIG LSIGEPPVSR GYTPSVFSIM PKLLERAGNS
QSGSITGLYT VLVDGDDLTE PVTDTARGIL DGHIVLSRNL ANKNQYPAID VLASVSRVMP
DIVDDEHQKI ANDIKKTMAI YREAEDLINV GAYAKGSNEK IDYAIEVIDK IQEFIKQGVH
ERYSYEETIN LMKNVLI