Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3050 |
Symbol | |
ID | 4811122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3577303 |
End bp | 3579306 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108471 |
Product | fibronectin, type III |
Protein accession | YP_001039439 |
Protein GI | 125975529 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000157497 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATAAGT GGGAAATTAT CAAGAAAACA GTCTCAGTTT GTCTTTTATT ATCCATAACG TTTTCAATTT TTATTAATTC GGATGTTGTT TTGGCATTAA GTCAAAAAGT AGAGAATATA AATTTTGGAG AGAATATAAA TTTCTATGGA GTAGTACAAG ATTCAGCAAT TAGATTGGTT ACACCTACCG TGGGTGTTGA ACCGACCCCT ATTGGGAGCT TGACGCCGGC AGTCACAGAA ACACCTTTGG CAACACCTAC TCAAGAGACG TTGCCGTCAC CGTCTCCATC AGAGTTTTCT ACTCCTACGC CATCATTTAC TCCTGATGCA TCACCGGAAT CTACTTCTAC GCCATTTCCG TCGCCGTTAC CATTCCCTAT GCCGGATTCA ACTTCTACTC CAACACCGGA TCCGGATTCA ACTTCGACGC CAACGCCAAC TCCGACCCCA ATTCCGACTC CGTCAGTTTC ACCTTCACCA TCGGATACGG AGGCACCGTC AAGACCGGAA TGTTTAGTTA CTACGGACAG AACTGACACA ATGATTTCTT TGTCGTGGAG TGCTTCAACT GACAATGTTG GAGTAAAGGG TTACTATATA TACAGAGACG GAGTAAAACT GGATGTTAGT GTGACAGAAC CATGTTTTAC TGATGAAGGG TTAACAGAGA ACACCACATA CAGATATTAT GTAACAGCTT ACGATGAAGC GGGCAATGAA TCGGAAAGAA GTACGGAACT TGTAGTTATG ACCCTTGCTA ATAATATTAC GGGGCTGAAT GCTGTTGTAA ATGTGGATGG AAGTATATTG GTTTCGTGGA ATCAAGTTGC AAGAGCCGCG GCATATGAAT TAATGATAGA TGAATATGAA TCCGTATGTA TCTATGATAC AAGTTATTTG CATACAGGTC TTCTGCCGAA TACGCGTCAT ACATACAGAG TCAGGGTTAG ATATGCCGGT AACGACTATG GGGCATGGAG CGAGAAAAAG GTAGTTTTCA GCTTTCCGGG CAAACCCTTG GATGTGGGTG CTGATATAAT TGATGATAAT TCTGTGAGGA TATTCTGGAA CCAAGTTCAG GGGATAAGTA AGTATAACGT GTATAGAGAC GGTGAGTTGA TAGCTGCTGA AGTCAAAGCG GTTGAGTTTA CGGATACGGG TTTGACTGCA GGTAGAGATT ATGAATATGA GGTAAGAGCA GTTTCAGGGG ACAGTGAGTC TGTAGAAAAC CAAAGGGTAA TTGTTAATAC CGGGACAGGA AGTATATCTG CAAATACGGT GTTGAATGAA AATAGGGTAT ATAAGAGTTT TAATTTGAAA AGCAGGATTA TTAATTTGAA TGGTTATAGG TTTAAAGTTG AGGGGGATCT TGTACAGTCC GGAGGGACAT TGGATGTAAA CGGAGGAAGG TTGGAGGTAA CAGGAAACTA TACAATAAGT GGGTCCTCAT ATTTGGAGAT GACGGAAGAG GAAGATTATG TATTAGTAAG AGGGGATTTT GAGACAAGAA GCGATAATAA TCACGAAAAC AAGTTGACAG CAGGTACATT AGAGGTGAAA GGGAATTTTA CGAGGAAGGC TGGTGTGAGT GCTAATTTTA AAGCAAGTGG AACCCATAGG GTTGTATTAA GTGGAGAAAA GCAGCAGACA ATAGACTTTT CGAGCACGAA TATACAACAA TTTAACATAT TAGAGAATAA AAATACATCA GGGAAAGAGT TAATATTTAA AAATTCATAT AATGCGAAGA TATTTATAAA CAACACATCA GGTTTATCAC CGATGACAGT AGAGTATCAT GATTGGAATT TGACAGGTAA CGAAGTGATA AATGGAGACT TGTATATAAA GGGTAAAACA TTAAATCTTG CAGGAAAAAC ATTAAAAGTA AATGGGAACT TAATCCAGAC CGGGGGGTAC ATTGGATGTA AATGGAGGAA GATTAGAGGT AGAGGGAAAC TACACAATAA GCGGGTCTTC ATATTTGAAG ATGACGGAAG ATGA
|
Protein sequence | MNKWEIIKKT VSVCLLLSIT FSIFINSDVV LALSQKVENI NFGENINFYG VVQDSAIRLV TPTVGVEPTP IGSLTPAVTE TPLATPTQET LPSPSPSEFS TPTPSFTPDA SPESTSTPFP SPLPFPMPDS TSTPTPDPDS TSTPTPTPTP IPTPSVSPSP SDTEAPSRPE CLVTTDRTDT MISLSWSAST DNVGVKGYYI YRDGVKLDVS VTEPCFTDEG LTENTTYRYY VTAYDEAGNE SERSTELVVM TLANNITGLN AVVNVDGSIL VSWNQVARAA AYELMIDEYE SVCIYDTSYL HTGLLPNTRH TYRVRVRYAG NDYGAWSEKK VVFSFPGKPL DVGADIIDDN SVRIFWNQVQ GISKYNVYRD GELIAAEVKA VEFTDTGLTA GRDYEYEVRA VSGDSESVEN QRVIVNTGTG SISANTVLNE NRVYKSFNLK SRIINLNGYR FKVEGDLVQS GGTLDVNGGR LEVTGNYTIS GSSYLEMTEE EDYVLVRGDF ETRSDNNHEN KLTAGTLEVK GNFTRKAGVS ANFKASGTHR VVLSGEKQQT IDFSSTNIQQ FNILENKNTS GKELIFKNSY NAKIFINNTS GLSPMTVEYH DWNLTGNEVI NGDLYIKGKT LNLAGKTLKV NGNLIQTGGY IGCKWRKIRG RGKLHNKRVF IFEDDGR
|
| |