Gene Cthe_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3080 
Symbol 
ID4809954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3634961 
End bp3636304 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content43% 
IMG OID640108504 
Productcellulosome anchoring protein, cohesin region 
Protein accessionYP_001039469 
Protein GI125975559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAA TAAAAAGAAT TTTAGCAGTG CTTACAATTT TCGCTTTGCT TGCAACTATT 
AATGCATTCA CGTTTGTTTC ACTGGCACAA ACAAACACCA TTGAAATAAT TATAGGTAAT
GTCAAAGCAC GACCGGGTGA CAGAATTGAG GTGCCGGTAA GCCTGAAAAA TGTTCCTGAC
AAAGGAATAG TCAGTTCAGA CTTCGTAATT GAATATGACT CAAAACTCTT TAAAGTAATA
GAATTAAAGG CCGGAGACAT TGTGGAAAAT CCTTCAGAAA GCTTTAGTTA CAATGTAGTG
GAGAAGGACG AAATTATTGC CGTTTTGTAT TTGGAAGAAA CCGGTTTGGG TATCGAGGCC
ATAAGAACCG ACGGAGTATT CTTTACAATA GTGATGGAAG TAAGCAAAGA TGTAAAGCCG
GGGATTAGCC CGATAAAATT TGAAAGCTTT GGGGCTACTG CAGATAATGA TATGAACGAA
ATGACCCCAA AACTTGTGGA AGGTAAAGTG GAAATTATTG AAGCATCCGC TCCGGAGGCA
ACTCCGACAC CGGGTTCAAC GGCCGGATCG GGTGCAGGTG GCGGTACGGG TTCTTCCGGT
TCCGGACAGC CGTCAGCAAC GCCAACGCCA ACGGCAACGG AAAAACCGTC AACTACTCCA
AAGACAACTG AGCAGCCGCA TGAAGACATA CCTCAGAGCG GTGGTACAGG CGAGCATGCA
CCGTTCCTTA AAGGATATCC GGGGGGACTG TTCAAGCCTG AGAACAATAT TACAAGGGCG
GAAGCGGCAG TTATCTTTGC CAAACTTTTA GGTGCGGATG AAAACAGCGC AGGCAAAAAT
TCATCCATCA CTTTTAAGGA TTTAAAAGAC AGCCACTGGG CGGCATGGGC TATAAAATAT
GTTACGGAGC AAAATCTCTT TGGCGGCTAT CCCGACGGAA CTTTTATGCC GGACAAGAGC
ATAACAAGGG CTGAATTTGC AACCGTTACT TACAAATTCC TTGAGAAACT TGGAAAAATC
GAACAGGGAA CCGATGTCAA GACTCAGTTA AAAGACATAG AAGGACACTG GGCTCAAAAG
TATATTGAGA CTTTGGTTGC AAAAGGATAT ATAAAAGGCT ATCCTGATGA AACTTTCAGA
CCTCAGGCAA GTATTAAGAG GGCGGAATCT GTAGCTCTCA TTAACAGATC CCTTGAAAGA
GGTCCGCTGA ACGGTGCAGT TCTTGAGTTT ACGGATGTTC CTGTAAACTA TTGGGCATAC
AAGGATATAG CTGAGGGTGT AATTTATCAC AGTTATAAAA TTGATGAAAA CGGACAGGAA
GTAATGGTTG AAAAGCTTGA TTAA
 
Protein sequence
MKRIKRILAV LTIFALLATI NAFTFVSLAQ TNTIEIIIGN VKARPGDRIE VPVSLKNVPD 
KGIVSSDFVI EYDSKLFKVI ELKAGDIVEN PSESFSYNVV EKDEIIAVLY LEETGLGIEA
IRTDGVFFTI VMEVSKDVKP GISPIKFESF GATADNDMNE MTPKLVEGKV EIIEASAPEA
TPTPGSTAGS GAGGGTGSSG SGQPSATPTP TATEKPSTTP KTTEQPHEDI PQSGGTGEHA
PFLKGYPGGL FKPENNITRA EAAVIFAKLL GADENSAGKN SSITFKDLKD SHWAAWAIKY
VTEQNLFGGY PDGTFMPDKS ITRAEFATVT YKFLEKLGKI EQGTDVKTQL KDIEGHWAQK
YIETLVAKGY IKGYPDETFR PQASIKRAES VALINRSLER GPLNGAVLEF TDVPVNYWAY
KDIAEGVIYH SYKIDENGQE VMVEKLD