Gene Cthe_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1307 
Symbol 
ID4809560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1586303 
End bp1588198 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content43% 
IMG OID640106731 
Productcellulosome anchoring protein, cohesin region 
Protein accessionYP_001037732 
Protein GI125973822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGA AAAAAAGATT AATATCATTA CTGCTTGCGG TTTTTATCGC CGTTGCATGT 
CTGCCGGCGG GAATTGCAAG GGCAGATAAA GCCTCGAGCA TTGAGCTTAA GTTTGACCGC
AATAAGGGAG AAGTTGGAGA TATACTTATT GGTACCGTAA GGATAAACAA TATCAAGAAT
TTCGCAGGAT TTCAGGTAAA CATTGTATAT GATCCAAAAG TCTTAATGGC TGTTGACCCT
GAAACGGGGA AAGAATTTAC TTCTTCAACA TTTCCGCCAG GACGCACTGT ACTGAAAAAC
AATGCTTACG GCCCAATACA GATTGCGGAC AATGATCCGG AAAAAGGGAT ACTGAACTTC
GCGCTTGCAT ATTCATATAT TGCGGGATAC AAAGAAACAG GAGTAGCGGA GGAAAGCGGC
ATAATTGCGA AAATTGGATT TAAAATACTC CAGAAAAAGA GCACTGCCGT AAAATTCCAG
GATACATTAA GCATGCCCGG AGCTATTTCG GGAACACAGC TGTTTGACTG GGACGGAGAA
GTTATTACCG GATATGAGGT AATACAGCCG GATGTGCTGA GTTTGGGTGA CGAGCCTTAT
GAGACACCGG GAACGGATAT TCCGATATCC GACAATCCGG CAGCAACTCC GTCATCCACG
CCGTCAGTTA CTCCTTCACC GGAAGTTAAA CCGACTCAGA CGCCTTCGCC TGCAGAAAAT
TCTGCAAAAG TGGAGCTTGA ACCTGTGTTG GATAATGCAA CAGGAGAAGC AAAGGCGGCA
ATAGATGAAG AAAAATTAAA CAAGGCTCTT GATGAAGCGA AAAAATCGGA AGATGACAAA
CTTGTGGAAC TTAACATAAA GAAGGTTGAA AATGCCGATG CTTACATACA ACAGCTTCCG
GCGAAATTCC TGATAAAAAG TGACGCCGAA TATAAGCTGA GAATAGCTAC AGAGCAGGGA
ATTATAGAAG TACCGGCCAA CATGCTGAAT ACTGCGGATA TTTCAAAGCT TGTAAAAAAT
GACTCCGTTG TTGAATTCGT CATAAGAAAA GTAAAAGTCG ATGAACTTGG TGCAGAGCTC
AAAGAGAAGA TAGGCAACAG GCCGGTGATT GACATAAGCG TGGTTGTTGA CGGCAAAAAA
GTTGAATGGA GCAATTACAA AGCCAAGGTT AAAATATCAA TTCCTTACAA GCCTGATGCA
AAAGAGCTGG AGAACCACGA GCATATTGTT GTACTCCATA TTGATGACGC CGGCAAGGCA
GTTTCCGTAC CCAGCGGAAA ATATGAACCT TCTTTGGGCG TCGTTACGTT TGAGACGAAT
CATTTAAGCA AGTATGCGGT TTCATATGTT TACAAGACTT TCGCGGATAT TGGTTCATAT
GCCTGGGCTA AAAAGCAGAT AGAGGTTTTG GCTTCCAAAG GAGTAATTAA CGGTACATCC
GATACCACTT TTACGCCCCA GGCAGACATA ACAAGGGCGG ATTTCATGAT ACTTCTTGTA
AAGGCACTGG GATTGACTGC CGAGGTTACT TCCAATTTTG ATGATGTGTC CGAAAAAGAC
TACTATTATG AATACGTGGG AATTGCAAAA GAGCTTGGAA TTACGACAGG AGTCGGAAAC
AACAAGTTCA ATCCGAAAGC CAAAATTACA AGACAGGATA TGATGGTACT TACAACAAAT
GCTCTCAGGA TTGCAGGAAA AATATCGAGC ACAGGAACCC GCGCTGATGT TGAAAGATTT
TCGGACAAGG ACCAGATAGC TTCATATGCG GTTGAAGGCG TTGCAACCTT GGTAAAAGAA
GGTATTGTAG TGGGAAGCGG CGATATTATA AATCCAAGGG GAAATGCTTC AAGAGCCGAA
CTTGCAGCAA TCATATACAA GATTTACTAC AAGTAA
 
Protein sequence
MRKKKRLISL LLAVFIAVAC LPAGIARADK ASSIELKFDR NKGEVGDILI GTVRINNIKN 
FAGFQVNIVY DPKVLMAVDP ETGKEFTSST FPPGRTVLKN NAYGPIQIAD NDPEKGILNF
ALAYSYIAGY KETGVAEESG IIAKIGFKIL QKKSTAVKFQ DTLSMPGAIS GTQLFDWDGE
VITGYEVIQP DVLSLGDEPY ETPGTDIPIS DNPAATPSST PSVTPSPEVK PTQTPSPAEN
SAKVELEPVL DNATGEAKAA IDEEKLNKAL DEAKKSEDDK LVELNIKKVE NADAYIQQLP
AKFLIKSDAE YKLRIATEQG IIEVPANMLN TADISKLVKN DSVVEFVIRK VKVDELGAEL
KEKIGNRPVI DISVVVDGKK VEWSNYKAKV KISIPYKPDA KELENHEHIV VLHIDDAGKA
VSVPSGKYEP SLGVVTFETN HLSKYAVSYV YKTFADIGSY AWAKKQIEVL ASKGVINGTS
DTTFTPQADI TRADFMILLV KALGLTAEVT SNFDDVSEKD YYYEYVGIAK ELGITTGVGN
NKFNPKAKIT RQDMMVLTTN ALRIAGKISS TGTRADVERF SDKDQIASYA VEGVATLVKE
GIVVGSGDII NPRGNASRAE LAAIIYKIYY K