Gene Cthe_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0465 
Symbol 
ID4808393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp579318 
End bp580871 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content41% 
IMG OID640105879 
Productflagellar M-ring protein FliF 
Protein accessionYP_001036896 
Protein GI125972986 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1766] Flagellar biosynthesis/type III secretory pathway lipoprotein 
TIGRFAM ID[TIGR00206] flagellar basal-body M-ring protein/flagellar hook-basal body protein (fliF) 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAGG TACTATCAAA AATGCAGCAG CAGGTCACAG ATTTCTGGAA AAACCTGGAC 
AAGTCCCAAA AAACGAGGAT TCTTGTGACT TCCGGTATTT TGGTAGTTGT ACTTACAATA
GCCATAGTAA TGCTTACAAG GACTACATAT GTTCCTTTGA TTACAGTTCA GGATCCTGAT
AGCATTTCTG CTATTGAGGA GGCTCTGAAG GAAAGAAACA TAAAATATAA GCACGGCGAG
GGAAGGCGTA TTCTTGTCGA CTCAAAGGAC AAGAATGAGG CTGAATTTGC TCTTGCATCC
GCAGGATTGA CTGAGCCGGG AATGACATTT GAGGATGCAT GGAGCCTTCT TAAAGTAAGT
TCTTCTGAGA GTGACAAAAA GCAGTTGTGG CAGAATTTCA AGAAAAACAG TCTTATTGCG
AAACTTAAAA TGTTTGACAA TGTAAAAGAC GCCGACATTG AGCTTACAAT ACCGGAAGAT
ACCATGTTTT TCACCGATTC AAAGAGTGAA GCCAAAGCAG CTGTCAGAAT AACTCCCAAG
GGAGAATTAA CTCCCGAGCA GGTTGAAGGA ATAGTTATGG TAGTTGCCTC GTCTATAGAA
GGACTGGATC CTAAAAACGT AACGGTTGTA GACAATAACT TTAATATATT GAATCAAGAT
TTGTCAGACG GCATGAATAT ACCCTCGAGC CATTACAAGC TTAAACTTCG TATAAAGGAA
GAGCTTGAAA AGAATATAAA AAACCTGTAT TCCGGTCGTT CGGACAGCTA TGACTTTATA
AGTGTTGCCG TAAATCCGGT TTTGGATCTG GACAAAGTTA CAAAGAACAG AAAAGAAATT
GAAAAGCCTA CCGGATTGGA TGAGGCTGTA GTCAGTGAGG AAAGAAAAAC CGAAGAGCTG
ATAAACGGAA ATCAAGGCGG CGCTCCGGGA ATGGATGCAA ATCCGGGAAC CGGGGATGTT
CCGACTTATC CTATAGATGC AGGACAAAAC TCTTCTTATG AAAGCAAATC GGAGATAATA
AACAGAATAT TTACCGAGAC ATTGACAGCG GAAGAAAAAG CCATAGGGAC AATGAATTTT
CAAGAATCCT CAATGACGGT TGCCCTGTGG TATGGGAATA GAGTGCCTGA TGACAGCAAA
CTGACAGATG AATTTATAGA AGAGTTTAAA CAGGGGTTGA GCAATGCTAC AGGAATTCCT
GTTGGAAAAA TAACTGTTAA CAAGCAGAAA TTGGCACCTC AGGAGGAAGA GATAGTGCCA
ATGTCCGAAA GGATAAAACA ATTTATAGAT GATTATGGTT TCTTTGCACT GTTGATTATA
TTGATAATTG CGTTAATGCT TTCAGTAATG CCGAGGAAGA AAAAATCACC GCAGCTGGCG
CCTGAACTTG CAACGGCGGG AGGTCCTAAT GTTGATGAAG CTGAAGAAGA ATTGCCTCCT
ATAAACTTTG AAGAACACTC TGAAATCAAG AAACAGATTG AAAACTTTGT AAAACAAAAG
CCGGAGTCAG TTGCGCAGCT TCTTAGAAAT TGGTTGTCCG AAGACTGGGA TTAA
 
Protein sequence
MPEVLSKMQQ QVTDFWKNLD KSQKTRILVT SGILVVVLTI AIVMLTRTTY VPLITVQDPD 
SISAIEEALK ERNIKYKHGE GRRILVDSKD KNEAEFALAS AGLTEPGMTF EDAWSLLKVS
SSESDKKQLW QNFKKNSLIA KLKMFDNVKD ADIELTIPED TMFFTDSKSE AKAAVRITPK
GELTPEQVEG IVMVVASSIE GLDPKNVTVV DNNFNILNQD LSDGMNIPSS HYKLKLRIKE
ELEKNIKNLY SGRSDSYDFI SVAVNPVLDL DKVTKNRKEI EKPTGLDEAV VSEERKTEEL
INGNQGGAPG MDANPGTGDV PTYPIDAGQN SSYESKSEII NRIFTETLTA EEKAIGTMNF
QESSMTVALW YGNRVPDDSK LTDEFIEEFK QGLSNATGIP VGKITVNKQK LAPQEEEIVP
MSERIKQFID DYGFFALLII LIIALMLSVM PRKKKSPQLA PELATAGGPN VDEAEEELPP
INFEEHSEIK KQIENFVKQK PESVAQLLRN WLSEDWD