Gene Cthe_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3050 
Symbol 
ID4811122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3577303 
End bp3579306 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content40% 
IMG OID640108471 
Productfibronectin, type III 
Protein accessionYP_001039439 
Protein GI125975529 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000157497 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAAGT GGGAAATTAT CAAGAAAACA GTCTCAGTTT GTCTTTTATT ATCCATAACG 
TTTTCAATTT TTATTAATTC GGATGTTGTT TTGGCATTAA GTCAAAAAGT AGAGAATATA
AATTTTGGAG AGAATATAAA TTTCTATGGA GTAGTACAAG ATTCAGCAAT TAGATTGGTT
ACACCTACCG TGGGTGTTGA ACCGACCCCT ATTGGGAGCT TGACGCCGGC AGTCACAGAA
ACACCTTTGG CAACACCTAC TCAAGAGACG TTGCCGTCAC CGTCTCCATC AGAGTTTTCT
ACTCCTACGC CATCATTTAC TCCTGATGCA TCACCGGAAT CTACTTCTAC GCCATTTCCG
TCGCCGTTAC CATTCCCTAT GCCGGATTCA ACTTCTACTC CAACACCGGA TCCGGATTCA
ACTTCGACGC CAACGCCAAC TCCGACCCCA ATTCCGACTC CGTCAGTTTC ACCTTCACCA
TCGGATACGG AGGCACCGTC AAGACCGGAA TGTTTAGTTA CTACGGACAG AACTGACACA
ATGATTTCTT TGTCGTGGAG TGCTTCAACT GACAATGTTG GAGTAAAGGG TTACTATATA
TACAGAGACG GAGTAAAACT GGATGTTAGT GTGACAGAAC CATGTTTTAC TGATGAAGGG
TTAACAGAGA ACACCACATA CAGATATTAT GTAACAGCTT ACGATGAAGC GGGCAATGAA
TCGGAAAGAA GTACGGAACT TGTAGTTATG ACCCTTGCTA ATAATATTAC GGGGCTGAAT
GCTGTTGTAA ATGTGGATGG AAGTATATTG GTTTCGTGGA ATCAAGTTGC AAGAGCCGCG
GCATATGAAT TAATGATAGA TGAATATGAA TCCGTATGTA TCTATGATAC AAGTTATTTG
CATACAGGTC TTCTGCCGAA TACGCGTCAT ACATACAGAG TCAGGGTTAG ATATGCCGGT
AACGACTATG GGGCATGGAG CGAGAAAAAG GTAGTTTTCA GCTTTCCGGG CAAACCCTTG
GATGTGGGTG CTGATATAAT TGATGATAAT TCTGTGAGGA TATTCTGGAA CCAAGTTCAG
GGGATAAGTA AGTATAACGT GTATAGAGAC GGTGAGTTGA TAGCTGCTGA AGTCAAAGCG
GTTGAGTTTA CGGATACGGG TTTGACTGCA GGTAGAGATT ATGAATATGA GGTAAGAGCA
GTTTCAGGGG ACAGTGAGTC TGTAGAAAAC CAAAGGGTAA TTGTTAATAC CGGGACAGGA
AGTATATCTG CAAATACGGT GTTGAATGAA AATAGGGTAT ATAAGAGTTT TAATTTGAAA
AGCAGGATTA TTAATTTGAA TGGTTATAGG TTTAAAGTTG AGGGGGATCT TGTACAGTCC
GGAGGGACAT TGGATGTAAA CGGAGGAAGG TTGGAGGTAA CAGGAAACTA TACAATAAGT
GGGTCCTCAT ATTTGGAGAT GACGGAAGAG GAAGATTATG TATTAGTAAG AGGGGATTTT
GAGACAAGAA GCGATAATAA TCACGAAAAC AAGTTGACAG CAGGTACATT AGAGGTGAAA
GGGAATTTTA CGAGGAAGGC TGGTGTGAGT GCTAATTTTA AAGCAAGTGG AACCCATAGG
GTTGTATTAA GTGGAGAAAA GCAGCAGACA ATAGACTTTT CGAGCACGAA TATACAACAA
TTTAACATAT TAGAGAATAA AAATACATCA GGGAAAGAGT TAATATTTAA AAATTCATAT
AATGCGAAGA TATTTATAAA CAACACATCA GGTTTATCAC CGATGACAGT AGAGTATCAT
GATTGGAATT TGACAGGTAA CGAAGTGATA AATGGAGACT TGTATATAAA GGGTAAAACA
TTAAATCTTG CAGGAAAAAC ATTAAAAGTA AATGGGAACT TAATCCAGAC CGGGGGGTAC
ATTGGATGTA AATGGAGGAA GATTAGAGGT AGAGGGAAAC TACACAATAA GCGGGTCTTC
ATATTTGAAG ATGACGGAAG ATGA
 
Protein sequence
MNKWEIIKKT VSVCLLLSIT FSIFINSDVV LALSQKVENI NFGENINFYG VVQDSAIRLV 
TPTVGVEPTP IGSLTPAVTE TPLATPTQET LPSPSPSEFS TPTPSFTPDA SPESTSTPFP
SPLPFPMPDS TSTPTPDPDS TSTPTPTPTP IPTPSVSPSP SDTEAPSRPE CLVTTDRTDT
MISLSWSAST DNVGVKGYYI YRDGVKLDVS VTEPCFTDEG LTENTTYRYY VTAYDEAGNE
SERSTELVVM TLANNITGLN AVVNVDGSIL VSWNQVARAA AYELMIDEYE SVCIYDTSYL
HTGLLPNTRH TYRVRVRYAG NDYGAWSEKK VVFSFPGKPL DVGADIIDDN SVRIFWNQVQ
GISKYNVYRD GELIAAEVKA VEFTDTGLTA GRDYEYEVRA VSGDSESVEN QRVIVNTGTG
SISANTVLNE NRVYKSFNLK SRIINLNGYR FKVEGDLVQS GGTLDVNGGR LEVTGNYTIS
GSSYLEMTEE EDYVLVRGDF ETRSDNNHEN KLTAGTLEVK GNFTRKAGVS ANFKASGTHR
VVLSGEKQQT IDFSSTNIQQ FNILENKNTS GKELIFKNSY NAKIFINNTS GLSPMTVEYH
DWNLTGNEVI NGDLYIKGKT LNLAGKTLKV NGNLIQTGGY IGCKWRKIRG RGKLHNKRVF
IFEDDGR