Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1105 |
Symbol | |
ID | 4811403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1314018 |
End bp | 1315232 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106527 |
Product | type II secretion system protein |
Protein accession | YP_001037530 |
Protein GI | 125973620 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0187302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTTT ACAGCTACAA GGTTAAAAAT GAGGCCGGCA AGTTGTTTAC CGGAGAGGCC AAGGTGGACA GCGAGGAGGA ACTTCGGCGA CTGCTTCTGG ACAAGGGATA CACCCCGGTA GAAATTGTTG AGAAAAATGT AATAAACGAT ATAAGTCAGA TTCGTTTGTT CAAGCCGAGA GTAAAAGTAA AGGATTTGGC TGTATTCTGC AGGCAGTTTT CCATAGTGTT GGAAGCCGGA GTACCTATAG CAAATGCATT GGACGTGTTG AGGGAACAGA CTACAAACAG AACTTTGAGA GAGTGTCTTG ATGATGTTTA TGACAACATA CAAAAAGGTA TTGCCCTTTC CAACGCCATG CGGCAGCATC CGAGAATTTT TCCGGAGATG CTGATTAACA TGGTTGAAGC TGGAGAAATA AGCGGACAGC TGGATCTGGT TTTTAAGAGA ATGGCAATTC AGTTTGAAAA GGAAAACAAA TTGAACCAGA AAATAAGAGG TGCACTTACA TATCCGATTA TAGTAACGGT TGTGGCAATA GCCGTTATAA TGATATTGAT GGTGGCTGTT GTGCCGACGT TTGTCAAAGT TCTTGCGGAT TTTGATGTTG AGATGCCCAT TTATACAAGA ATATTGATTG CGGTAAGCGG TTTCTTTAAA TCTTTTTGGT TTATTATACT TGGCGCTTTG ATTGTTATTG GTGCGGGAAT AGCATATTTT TCACGAACCT ATGAAGGAAA GATATTTTTT GGCACACTCG CTATCAAGCT TCCTGTGATA AGAGGAGTTA CGAGGAATAT AATGACGGCA AGGCTTACAA GAACATTGGG AACGCTGATG TCCAGCGGCG TCTTGTTGAT TCAGGCGATG GAAGTTGTCC AGAAAGTATT GGGAAATCAG GTTATAAGGG AGAAAATTGA CGGTGTTATT GAGGAAATAA AAAAAGGAAG AGGTCTTACA GCACCGCTTG CAGCATTGAA TTATTTTCCG CCGATGGTCA TTTCGATGAT CCGAATCGGA GAAGAATCAG GTAATCTGGA TTTCGCTCTT GATAAATCGG CAGATTTTTA CGACGAGGAA GTTGAGGCTT CCCTTGCAAT GCTGACAAGT TTTATTGAAC CTGCAATTAT AATTGTGTTG GCTTTGGTTG TTGGTTTTAT AGTATTGAGT GTGTTGACAC CAATGTTCAC TATTTACAAC GAAATGTCTT TTTAG
|
Protein sequence | MPLYSYKVKN EAGKLFTGEA KVDSEEELRR LLLDKGYTPV EIVEKNVIND ISQIRLFKPR VKVKDLAVFC RQFSIVLEAG VPIANALDVL REQTTNRTLR ECLDDVYDNI QKGIALSNAM RQHPRIFPEM LINMVEAGEI SGQLDLVFKR MAIQFEKENK LNQKIRGALT YPIIVTVVAI AVIMILMVAV VPTFVKVLAD FDVEMPIYTR ILIAVSGFFK SFWFIILGAL IVIGAGIAYF SRTYEGKIFF GTLAIKLPVI RGVTRNIMTA RLTRTLGTLM SSGVLLIQAM EVVQKVLGNQ VIREKIDGVI EEIKKGRGLT APLAALNYFP PMVISMIRIG EESGNLDFAL DKSADFYDEE VEASLAMLTS FIEPAIIIVL ALVVGFIVLS VLTPMFTIYN EMSF
|
| |