Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1368 |
Symbol | |
ID | 4809363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1661035 |
End bp | 1663167 |
Gene Length | 2133 bp |
Protein Length | 710 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106792 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001037793 |
Protein GI | 125973883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000486639 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TCCATAACAA ACAACTTCAA AATATCATCT GTAGTATTAT TGCCATTGTT ATGATTTTTG GTGTTATTGC GACGGTGGCA TATGCAGAAG TGGTACAGCC GGATGCGTTT CAAAAGGTAG TATTGCAGGT GATTAGTCTT ACAGAGCAAC AGAGGATAAA TTTTGTGAAC AATGTTCTTG ATCAGATTAC TACTGAGAAC TACAAAGACT ATGTGGATGA TGCAAAAGCA ATTCTTGGTT TGTCAATGTC TGACTCGGAC ATGGAAAAGG CATTGAAGAG TTATGCTTAC TATCCGGACA CGCACAAAGG CACTTTGAAG GAATTGATAA AAATCTTCGA ATTGCCTTCA AACCAGTACG ATACGTCAGC TTTTTCAGAC ATTGAGAAGA GGATTAATTT CAACATAACA GGAGATTCCA ATGACAGCAG AGGTTTCAAA CTGTTTGTTG AAATTTTCAA AGCCATCAGG AAATTTAATA ATGCACCTGT ATTCTATGAT GGTTCTGATG ATATTTACAA ACTTGATATT ACAGTTCAAG GAAACAATAC GTTGAAGAAT GAAATTAATG CTCTTATATC TTCCGTAGAG TCTCTTACCA AGAAAGATAT TAATGATTTT GATTCGTTTA CAGAGTATAT CGAAAATGAA ATTAATTCGA ATTCTTATGG ACAGATATAT GGATTGAAAG TATTCTTGAA AGATGAACTG GGATCCAGTA CATATGCCGG TACTTTGCCG GAGCCTTCTG AGATAGATCC TTTACAAAAA GTATTCCTGG CGATTATCAG TCTCCCTCAG AAGGATAGAA AAGCATTTGT AGAGAATGTG CTTGTTAAAC TTACACCGGA CAATTATGCC GATTATGTTT ATGAAGCAAA GAAAATCCTG GGCGTGACAA TTACTGACGG ACAAATGAAA GCTGCGCTCA AAGTATATGC GAGCTATTCG GAAACTTATC GGGCAAAAAT TGAAGGCATG ATATTGGCAT TTGATTTGTC GGCAATTAAA ATTGATACTT CCTTATTTGC TGATCTGGCA GCCGATATCA ATTTTGAAGT TACAGGAGAT GCTAATGATG ACAGGGGAAT AAAATTTGTT GTAAATACAT TGGATTGGCT GTCACAATTT ACTGGACCGA TAGTATTTGA CGGAACTGAG GTTCCTTACA AGGTGGATTT CAGAATTCAG GGCAATGCGA CAATGAAACG TCATCTTGAC GGATTGATAA AACTGATTAA GTCTTTGGAA AAGAGAGGAG TTACGAATTT TGATTCTTTC CTCGTTTTGG CTGAGGATAT AGTTAATGCT AATGACAATA TTCAGATTTA CAACTTTAAG AAGCTGTTGC GCGATTTGTA TGGAAGAAGT GTTTATGATG GAGAACTTCC ATATCCGCCA GTGACACCTA CACCTGCACC GACATCCGGT GGCGGTTCGG GTGGCTCCGG TGGTTCCGGT GGAGGATCAA CTGCTACACC GGCTCCTACA CCTACACCGA CATCCACTTC CATAGAAGAA CCGACACCGA GTGATGTGCC TGCAGCTCCA TTTAACGACA TTGCAGGTCA CTGGGCAGAA GAATTCATTG CAAAACTGGC AGCGAGGAAC GTTGTAAGCG GTTATCCTGA CGGAAGCGTT AAACCTGATA TTGAGATAAC AAGAGCTGAA ATGGCTGTAA TTGTTGTTAA GTCTGCAGGA CTTGAACCGG TTGAAAATGT ATCATTGAAG TTTAAAGATG CCGATCAGAT ACCTGCGTGG GCAGCGGGTT ATGTACAGGC AGGAGTTGAG GCAGGTATCA TCGCAGGATA TGAAGACAAT ACATTCAGAC CGTCAAGAAA TCTGACCAGA GAAGAAATGG TTGTATTAAT AATGAAGGCT TATGAATTTG GAGCTGTAGA AAATCCTGAG TTTAGCTTCA TAGATGCAAG TGAGATTGGA GATTGGTCCA AGCCGTTTGT TGGAAAGGCT GTTGAATTGG GATTTGTTGT AGGATATCCG GATAATACAT TTAAGCCTAA GAAGAGTGTA ACCAGAGCTG AAGCATTTAC AGTTCTCTGG AAAGCAATAG AGGCTAAAGA AGCGGCAAAT GCCCCTGCTG AAGAGGCTGA AGTTGCTGAA TAA
|
Protein sequence | MKKFHNKQLQ NIICSIIAIV MIFGVIATVA YAEVVQPDAF QKVVLQVISL TEQQRINFVN NVLDQITTEN YKDYVDDAKA ILGLSMSDSD MEKALKSYAY YPDTHKGTLK ELIKIFELPS NQYDTSAFSD IEKRINFNIT GDSNDSRGFK LFVEIFKAIR KFNNAPVFYD GSDDIYKLDI TVQGNNTLKN EINALISSVE SLTKKDINDF DSFTEYIENE INSNSYGQIY GLKVFLKDEL GSSTYAGTLP EPSEIDPLQK VFLAIISLPQ KDRKAFVENV LVKLTPDNYA DYVYEAKKIL GVTITDGQMK AALKVYASYS ETYRAKIEGM ILAFDLSAIK IDTSLFADLA ADINFEVTGD ANDDRGIKFV VNTLDWLSQF TGPIVFDGTE VPYKVDFRIQ GNATMKRHLD GLIKLIKSLE KRGVTNFDSF LVLAEDIVNA NDNIQIYNFK KLLRDLYGRS VYDGELPYPP VTPTPAPTSG GGSGGSGGSG GGSTATPAPT PTPTSTSIEE PTPSDVPAAP FNDIAGHWAE EFIAKLAARN VVSGYPDGSV KPDIEITRAE MAVIVVKSAG LEPVENVSLK FKDADQIPAW AAGYVQAGVE AGIIAGYEDN TFRPSRNLTR EEMVVLIMKA YEFGAVENPE FSFIDASEIG DWSKPFVGKA VELGFVVGYP DNTFKPKKSV TRAEAFTVLW KAIEAKEAAN APAEEAEVAE
|
| |