Gene Cthe_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1105 
Symbol 
ID4811403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1314018 
End bp1315232 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content41% 
IMG OID640106527 
Producttype II secretion system protein 
Protein accessionYP_001037530 
Protein GI125973620 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0187302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTTT ACAGCTACAA GGTTAAAAAT GAGGCCGGCA AGTTGTTTAC CGGAGAGGCC 
AAGGTGGACA GCGAGGAGGA ACTTCGGCGA CTGCTTCTGG ACAAGGGATA CACCCCGGTA
GAAATTGTTG AGAAAAATGT AATAAACGAT ATAAGTCAGA TTCGTTTGTT CAAGCCGAGA
GTAAAAGTAA AGGATTTGGC TGTATTCTGC AGGCAGTTTT CCATAGTGTT GGAAGCCGGA
GTACCTATAG CAAATGCATT GGACGTGTTG AGGGAACAGA CTACAAACAG AACTTTGAGA
GAGTGTCTTG ATGATGTTTA TGACAACATA CAAAAAGGTA TTGCCCTTTC CAACGCCATG
CGGCAGCATC CGAGAATTTT TCCGGAGATG CTGATTAACA TGGTTGAAGC TGGAGAAATA
AGCGGACAGC TGGATCTGGT TTTTAAGAGA ATGGCAATTC AGTTTGAAAA GGAAAACAAA
TTGAACCAGA AAATAAGAGG TGCACTTACA TATCCGATTA TAGTAACGGT TGTGGCAATA
GCCGTTATAA TGATATTGAT GGTGGCTGTT GTGCCGACGT TTGTCAAAGT TCTTGCGGAT
TTTGATGTTG AGATGCCCAT TTATACAAGA ATATTGATTG CGGTAAGCGG TTTCTTTAAA
TCTTTTTGGT TTATTATACT TGGCGCTTTG ATTGTTATTG GTGCGGGAAT AGCATATTTT
TCACGAACCT ATGAAGGAAA GATATTTTTT GGCACACTCG CTATCAAGCT TCCTGTGATA
AGAGGAGTTA CGAGGAATAT AATGACGGCA AGGCTTACAA GAACATTGGG AACGCTGATG
TCCAGCGGCG TCTTGTTGAT TCAGGCGATG GAAGTTGTCC AGAAAGTATT GGGAAATCAG
GTTATAAGGG AGAAAATTGA CGGTGTTATT GAGGAAATAA AAAAAGGAAG AGGTCTTACA
GCACCGCTTG CAGCATTGAA TTATTTTCCG CCGATGGTCA TTTCGATGAT CCGAATCGGA
GAAGAATCAG GTAATCTGGA TTTCGCTCTT GATAAATCGG CAGATTTTTA CGACGAGGAA
GTTGAGGCTT CCCTTGCAAT GCTGACAAGT TTTATTGAAC CTGCAATTAT AATTGTGTTG
GCTTTGGTTG TTGGTTTTAT AGTATTGAGT GTGTTGACAC CAATGTTCAC TATTTACAAC
GAAATGTCTT TTTAG
 
Protein sequence
MPLYSYKVKN EAGKLFTGEA KVDSEEELRR LLLDKGYTPV EIVEKNVIND ISQIRLFKPR 
VKVKDLAVFC RQFSIVLEAG VPIANALDVL REQTTNRTLR ECLDDVYDNI QKGIALSNAM
RQHPRIFPEM LINMVEAGEI SGQLDLVFKR MAIQFEKENK LNQKIRGALT YPIIVTVVAI
AVIMILMVAV VPTFVKVLAD FDVEMPIYTR ILIAVSGFFK SFWFIILGAL IVIGAGIAYF
SRTYEGKIFF GTLAIKLPVI RGVTRNIMTA RLTRTLGTLM SSGVLLIQAM EVVQKVLGNQ
VIREKIDGVI EEIKKGRGLT APLAALNYFP PMVISMIRIG EESGNLDFAL DKSADFYDEE
VEASLAMLTS FIEPAIIIVL ALVVGFIVLS VLTPMFTIYN EMSF