Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2506 |
Symbol | |
ID | 4809445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2973047 |
End bp | 2976088 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107922 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001038901 |
Protein GI | 125974991 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA AGATTGGAAA AGTTTTGAGG CAAAACAAGT CTATAATTTC CGTTGTAGTT ATAACAGCGA TTTTGTTTGT TTATAATACC GGAATGCTTT TTACGGGAAT ATGGGAAGGA GCACTGTACA ATGTGAAGGC TGAGGAAGTT CCTTTAAAGC TCGAGTTTTT CAACAATGTG AAGGATGACA ATGTAACCCT TATAAGTCCG TATTTTCGTG TTATCAATAA TAGTTCATCT GATGAAATTT ATTTGCAGCA TGTGAAGATT AGGTATTATT TTACACTGGA CAGCTCGGAC AGCGAAGAAA CCATGAATTA TGAGATTTAT TATGCGGGTA AAAGTAATAT AGACGGTACT GGAGCAGTGG AAGATATAAA GCCGAATACA ATCGTTAAAA TTGCCAAAAT GGATATACCG ACTGATATGG CAGACCATTA CCTTGAGATT GGATTTGACG AAAGTTGCGG AACCATAGGG CCGGATAAAA AAGTTGAAGT TATGGTAAGT ATTTCGAAGG AGAAATACAA GAAGTTTATA CAGACAAATG ACTACTCCTA TAATGATTCC GCCGAAAATT ACGTATCGTG GGAAAAAGTA ACTCTATACC TGGATGGTGA GCTGATTTCA GGAATTGAGC CTAATATGTA TGCAAGCAGG GAAACCGGTG CATGGTATAT GTTTGATGAA GCTGTCGAAG GTTCAACGAA CGAATTTAAG GACTATAAAG GTAATCACGG CAATGCGGTA CTGTATTCGG CAAACGGTGT TGTGCCGGGA TTGAACGGAA ACAGTGTGTC CCTGGACGGA GTTGATGATT ATGTTGCTTT GCCTGACGGA ATAGCCGGTA CTTTCTACAA CTTTACTATA GCTTTCTGGG TGAGGCTGGA CACCATAGGT GAACAGCCGA TATTTGATTT TTTCGATAGT GGTTCCAACA ACAAATATAT GCGTTTAACT GCCGAAAGTG ATGGAAAAAT TAAGTTTGCA ATGACGCAGT CAGGTTATTA TGGTGAGAAG ACCATTACTT CAGGCTCGGC TTTGACTGAA GGTGTCTGGA AGCATGTGGC AGTTACCTTG TCCGGAGACA CCGGTACTTT GTATATTAAT GGAGAGAATG TTGGGGAAAA TAACACGCTT TCTTTAAGAC CTTTAACATT TCTGGGAGAA ACTTCAAAAG GCTATATTGG AAAATCCCAT CAAACAGATT CATCGGAAGA TCCATACTAT AATTCATATC TTCATGGAAT GATAGATGAT TTCCGGATTT TCGACAGGGC TTTAAGTGCT GATGAAATTA AGACACTTGC AAGTGTTGCA ACAAGGGTAA ATGATTCGGA TCCGGGAATA CATTACAGCA GTGGCTGGAG CCATAGCCAG GAAAGAGATA AAGGTGATTA CTTAAATGAC GTTCATGAAC TTGATTCACC GGACGGAGAG AACTGTTTTG AGTATACATT TACCGGAACC GGAGTGAATG TCATAGCTCC CCAATGCAGT GATAACGGTG ACGCAGAGAT ATATATTGAC GGAAAACTTA TGAAGTCGGT GGCTATGAGC GTTTATTCGG GCTACAATTC CCAGGCAGTG GTTTACAGTA AACTGGGACT GTCACTCGGA ACCCATACAA TAAAAGTTGT ATTTAAAAAT GGTATTGGAA TTATAGACGC TTTAGACATA ATGACAGGTG AAATAGTAAG TCCGAGTCCG ACGCCGACGC CAAGCCCAAC ACCGAGTCCG ACACCGACGC CGAGCCCAAC ACCGAGTCCG ACACCGACGC CGACACCAAC ACCGAGCTCG ACACCGACAT CGACACCGAC ACCGGAACCG AGTCCGACAT CGACGCCGAC ACCGGAACCG AGTCCGACAT CGACGCCGAC ACCGGAACCA GAACCGGAGC CGACATCGAC GCCGACACCG GAACTGAGCC CAAGCTCGAC TCCAGTACCG ACGCCGACAC CGACTCCAAC GCCAGCGCCA AACCCTGCAC CGGAACCGGT ACCAATATCA ACACCGGTAC CGGAACCAAT ACTGATTCCG ACTCCAACGC CAACCATGAC ACCGATGCCG ACACCGACCC CAACTCTTGA GGTCAAAAGC GATCCATACC TTTCCGACCT TGTTGTGACA GGAGCAAAGC TTAAACCTGC GTTTGTTCCG GATATACTGA ATTACGAGGC AGTGGCGGAG GAAGATGTAA GGTTTGTGTG TATTGTTGCT TATGCACGGG ATGACGGAGC TGAGATTACT TTAAACGGTG TTCCTGTTAA AAGCGGAAGT ATATCCCATG CTGTTGAGCT TAAGGAAGGA AAAAATGAGC TGATTGTTAA AGTTGTGGCA GAAGATGGAA TAACATCAAG AACGTACCGA ATAAGTGTTT TACTTGAGGC ATTGCAATTG CCTACACCAA CTCCTGATAA AAGCGGCAAT CCATTTTTCT CGTCATTGGA GGATTTGCTT AAGGAAAATG AAGTGAGTCC AGACGGGACA AAGGGTGGGA TATTTGATGA TGTGCCCAGG GGATATTGGG CGGAAGAGTA TATTCAAAAG CTCTATGAAA AGGGAATTAT AAGCGGTATT GATGAAAAAA CGTTCATGCC CGGAAGACCT ATTACAAGGG CCGAGTTCAC ACAGATAATT GTTAATTCCC TGAAAATCCC TTACAGAGAA GCCGGACTGC ATTTCAACGA CGTGACTGAA AAAGACTGGT ATTACAAGAG CGTGTCTTCC GCCGCAGCCT TTGGAATAGT TGTCGGAAGA CCCGACGGAA GTTTTGCTCC AAATGAATTT ATAACCAGAC AGGATATGGC AGTGGTTATT GCCAAGTTTT TGGAGAAAAA ACATGACGGA AACCTGGAAG GAATGGGAAA AGGTCTGGTT TTTGCCGACA GCGGCAATAT CTCGGAATAT GCGCGGGATT CGGTGGCGGC TGTGGTATCT CAAGGGTTGA TGGTCGGAAA ACCCGGCAAC ATGTTTGATC CCAAAGGGCT TACCACAAGA GCGGAAGCGG TAACGGTCAT TTGCAAGCTT ATGAAGTATT AG
|
Protein sequence | MKEKIGKVLR QNKSIISVVV ITAILFVYNT GMLFTGIWEG ALYNVKAEEV PLKLEFFNNV KDDNVTLISP YFRVINNSSS DEIYLQHVKI RYYFTLDSSD SEETMNYEIY YAGKSNIDGT GAVEDIKPNT IVKIAKMDIP TDMADHYLEI GFDESCGTIG PDKKVEVMVS ISKEKYKKFI QTNDYSYNDS AENYVSWEKV TLYLDGELIS GIEPNMYASR ETGAWYMFDE AVEGSTNEFK DYKGNHGNAV LYSANGVVPG LNGNSVSLDG VDDYVALPDG IAGTFYNFTI AFWVRLDTIG EQPIFDFFDS GSNNKYMRLT AESDGKIKFA MTQSGYYGEK TITSGSALTE GVWKHVAVTL SGDTGTLYIN GENVGENNTL SLRPLTFLGE TSKGYIGKSH QTDSSEDPYY NSYLHGMIDD FRIFDRALSA DEIKTLASVA TRVNDSDPGI HYSSGWSHSQ ERDKGDYLND VHELDSPDGE NCFEYTFTGT GVNVIAPQCS DNGDAEIYID GKLMKSVAMS VYSGYNSQAV VYSKLGLSLG THTIKVVFKN GIGIIDALDI MTGEIVSPSP TPTPSPTPSP TPTPSPTPSP TPTPTPTPSS TPTSTPTPEP SPTSTPTPEP SPTSTPTPEP EPEPTSTPTP ELSPSSTPVP TPTPTPTPAP NPAPEPVPIS TPVPEPILIP TPTPTMTPMP TPTPTLEVKS DPYLSDLVVT GAKLKPAFVP DILNYEAVAE EDVRFVCIVA YARDDGAEIT LNGVPVKSGS ISHAVELKEG KNELIVKVVA EDGITSRTYR ISVLLEALQL PTPTPDKSGN PFFSSLEDLL KENEVSPDGT KGGIFDDVPR GYWAEEYIQK LYEKGIISGI DEKTFMPGRP ITRAEFTQII VNSLKIPYRE AGLHFNDVTE KDWYYKSVSS AAAFGIVVGR PDGSFAPNEF ITRQDMAVVI AKFLEKKHDG NLEGMGKGLV FADSGNISEY ARDSVAAVVS QGLMVGKPGN MFDPKGLTTR AEAVTVICKL MKY
|
| |