Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3080 |
Symbol | |
ID | 4809954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3634961 |
End bp | 3636304 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108504 |
Product | cellulosome anchoring protein, cohesin region |
Protein accession | YP_001039469 |
Protein GI | 125975559 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAA TAAAAAGAAT TTTAGCAGTG CTTACAATTT TCGCTTTGCT TGCAACTATT AATGCATTCA CGTTTGTTTC ACTGGCACAA ACAAACACCA TTGAAATAAT TATAGGTAAT GTCAAAGCAC GACCGGGTGA CAGAATTGAG GTGCCGGTAA GCCTGAAAAA TGTTCCTGAC AAAGGAATAG TCAGTTCAGA CTTCGTAATT GAATATGACT CAAAACTCTT TAAAGTAATA GAATTAAAGG CCGGAGACAT TGTGGAAAAT CCTTCAGAAA GCTTTAGTTA CAATGTAGTG GAGAAGGACG AAATTATTGC CGTTTTGTAT TTGGAAGAAA CCGGTTTGGG TATCGAGGCC ATAAGAACCG ACGGAGTATT CTTTACAATA GTGATGGAAG TAAGCAAAGA TGTAAAGCCG GGGATTAGCC CGATAAAATT TGAAAGCTTT GGGGCTACTG CAGATAATGA TATGAACGAA ATGACCCCAA AACTTGTGGA AGGTAAAGTG GAAATTATTG AAGCATCCGC TCCGGAGGCA ACTCCGACAC CGGGTTCAAC GGCCGGATCG GGTGCAGGTG GCGGTACGGG TTCTTCCGGT TCCGGACAGC CGTCAGCAAC GCCAACGCCA ACGGCAACGG AAAAACCGTC AACTACTCCA AAGACAACTG AGCAGCCGCA TGAAGACATA CCTCAGAGCG GTGGTACAGG CGAGCATGCA CCGTTCCTTA AAGGATATCC GGGGGGACTG TTCAAGCCTG AGAACAATAT TACAAGGGCG GAAGCGGCAG TTATCTTTGC CAAACTTTTA GGTGCGGATG AAAACAGCGC AGGCAAAAAT TCATCCATCA CTTTTAAGGA TTTAAAAGAC AGCCACTGGG CGGCATGGGC TATAAAATAT GTTACGGAGC AAAATCTCTT TGGCGGCTAT CCCGACGGAA CTTTTATGCC GGACAAGAGC ATAACAAGGG CTGAATTTGC AACCGTTACT TACAAATTCC TTGAGAAACT TGGAAAAATC GAACAGGGAA CCGATGTCAA GACTCAGTTA AAAGACATAG AAGGACACTG GGCTCAAAAG TATATTGAGA CTTTGGTTGC AAAAGGATAT ATAAAAGGCT ATCCTGATGA AACTTTCAGA CCTCAGGCAA GTATTAAGAG GGCGGAATCT GTAGCTCTCA TTAACAGATC CCTTGAAAGA GGTCCGCTGA ACGGTGCAGT TCTTGAGTTT ACGGATGTTC CTGTAAACTA TTGGGCATAC AAGGATATAG CTGAGGGTGT AATTTATCAC AGTTATAAAA TTGATGAAAA CGGACAGGAA GTAATGGTTG AAAAGCTTGA TTAA
|
Protein sequence | MKRIKRILAV LTIFALLATI NAFTFVSLAQ TNTIEIIIGN VKARPGDRIE VPVSLKNVPD KGIVSSDFVI EYDSKLFKVI ELKAGDIVEN PSESFSYNVV EKDEIIAVLY LEETGLGIEA IRTDGVFFTI VMEVSKDVKP GISPIKFESF GATADNDMNE MTPKLVEGKV EIIEASAPEA TPTPGSTAGS GAGGGTGSSG SGQPSATPTP TATEKPSTTP KTTEQPHEDI PQSGGTGEHA PFLKGYPGGL FKPENNITRA EAAVIFAKLL GADENSAGKN SSITFKDLKD SHWAAWAIKY VTEQNLFGGY PDGTFMPDKS ITRAEFATVT YKFLEKLGKI EQGTDVKTQL KDIEGHWAQK YIETLVAKGY IKGYPDETFR PQASIKRAES VALINRSLER GPLNGAVLEF TDVPVNYWAY KDIAEGVIYH SYKIDENGQE VMVEKLD
|
| |