Gene Cthe_1525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1525 
Symbol 
ID4810563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1850184 
End bp1852052 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content36% 
IMG OID640106945 
Producthypothetical protein 
Protein accessionYP_001037946 
Protein GI125974036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTCAT TTAAATTGGC TGTTATGAGT TTTAGAAGAA ATATAAAAGC TTATGGAATG 
TATCTTATGG CAATGATTTT ATCAGTAGCC ACCTATTATA ATTTTGCATC TATGAGATTC
AACCCTCAAT TCCGGGAGGC AAGAGATTTA ACTGTATATG TACAGAGTTC ATCAGTGGTT
GCCTCCCTGC TTATGATATT GTTTCTGATA TTTTTCATTA TGTATTCCGG CAACTTCTTT
CTGAACCAAA GGAAAAAGGA AATAGCAGTA TATGCTTTCA TGGGAATTGA TAACTATAAA
ATTGCCTTTA TGTTTGCATC GGAAGGATTG TTGATGGGGA TAATGTCTTT GGTAATCGGC
CTGTCGCTTG GAATTCTGTT CAGCAAATTG TTTCTGATGT TGCTTGCAAA GGTAGCTTTA
CTGAATATGA GAATTAATTT CTTCATATCA GTAAAGGCTA TTGTAGAGAC TGTAGTTGCA
TATTTGGTCA TTTTATTTAT TACATTCCTG AAAGGATATA TAGATGTTGT CAGGACAAAT
TTGATTGATT TGATAAATAC GTTGAAAAAA TCGGAGGAGC TTCCTAAAAT TAATTATTTA
AAAGGCATTG CCTCATTAAT GGTTATAGGT GCTGCATATT ATATTGCGGT AAATTATGGC
AAGTTCGGGT TTGGAAAAGC CCTCTTATGG ACAGTGATTC TGGTCGTTAT AGGCACTTAC
TGGCTGTTTG GTTCTCTTTT ATCAATGATT ATCAGGTACT TCATAAGCAG AAAAAAGTTT
TTGTATAAAG GCACAAATAT TATAAGCTTT TCAAATATAG CCTTTAGGAT AAAGGGCAAC
TATAGGGCCC TTGCAGCAGT AGCGGTATCG ATAACTGTGT GTATAACATC CTTTGGTACG
GTTAGCTCTC TTAAGTATTT TGTAAATGAG AACCATAAGA TTGAGGTACC ATATACTGTT
ACCTATATTT CCGAAAAACA GGAAGAAATA GAAAGAGTGG ATGAAATAAT AGGAAAATCG
AATCATAACG TTAAGCTGAA AGAAAAGGCC AACTTTTTGT TTGTCCCTGA TTCACAGGTT
GTAGTGGTGA AACTGTCCAC TTTTCAAAGG ATACTGACGG ATCTTAATGT TAAAGGGCGG
GATAAAATTT TATCTAAAAT TGGACAGCTG AAGGAAGAAG CGGTATATGT AGAGAGACCC
GGAGTCTTTA TGAGCCTGTT GGAAAAAAAT GATATAAAAA TAGGTGACAG GGTCTACAGA
ATAAAAGCTC AGACAAAGAT TCCTTTGTTT GGAAGCGGAT TGCCTTTTCC TTGTGTTGTT
GTCGGCGAGG AAGAATATGA AACATTAAAG TCTGAATTTG AAGAGAAACA GTTTAATGGA
ATTATACTTG ACAATCCGGA AGACACAAAG GATTTGACTT TACAGCTGGC TCAAATACTG
CCGGAGAATT CAAGACTATT CACCTATTTT ATAGCTGGCG CTGCAATGTA CGACTTAATT
GGAATAGTAT ATTTTCTTGG AGCTTTCCTG TTTCTAGTGT TTGTATTTGC CACAGGCAGC
ATAATATACT TTAAGATTTT GAGCGAATCT TTCAGAGATA AAGATAAATA CGAAATACTT
AAGAAACTGG GGACAACGGA TGTTGAAATC AAAAAGTCCG TATCAAAACA GGTGGGTGTG
TTTTTCCTGT TGCCGCTGAT AGTGGGGATA ATCCACAGCA CAGTTGCCAT TTCAGTATTA
AGTGACCTTA TGAGTTATAG TTTGACAGTG CCGACAATTA TAAGTATTGG CGTATTTATA
ATTGTATATG CGATATTCTA TGTCTTTACC GGAAGAAAAT ATGTTAATGT TGTAAGAAAT
CAGGCTTGA
 
Protein sequence
MNSFKLAVMS FRRNIKAYGM YLMAMILSVA TYYNFASMRF NPQFREARDL TVYVQSSSVV 
ASLLMILFLI FFIMYSGNFF LNQRKKEIAV YAFMGIDNYK IAFMFASEGL LMGIMSLVIG
LSLGILFSKL FLMLLAKVAL LNMRINFFIS VKAIVETVVA YLVILFITFL KGYIDVVRTN
LIDLINTLKK SEELPKINYL KGIASLMVIG AAYYIAVNYG KFGFGKALLW TVILVVIGTY
WLFGSLLSMI IRYFISRKKF LYKGTNIISF SNIAFRIKGN YRALAAVAVS ITVCITSFGT
VSSLKYFVNE NHKIEVPYTV TYISEKQEEI ERVDEIIGKS NHNVKLKEKA NFLFVPDSQV
VVVKLSTFQR ILTDLNVKGR DKILSKIGQL KEEAVYVERP GVFMSLLEKN DIKIGDRVYR
IKAQTKIPLF GSGLPFPCVV VGEEEYETLK SEFEEKQFNG IILDNPEDTK DLTLQLAQIL
PENSRLFTYF IAGAAMYDLI GIVYFLGAFL FLVFVFATGS IIYFKILSES FRDKDKYEIL
KKLGTTDVEI KKSVSKQVGV FFLLPLIVGI IHSTVAISVL SDLMSYSLTV PTIISIGVFI
IVYAIFYVFT GRKYVNVVRN QA