Gene Cthe_0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0515 
Symbol 
ID4808317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp630040 
End bp631575 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content46% 
IMG OID640105930 
Producttransposase IS66 
Protein accessionYP_001036945 
Protein GI125973035 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.016162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAG CAGAGCAGAT AGCAACCCTG GAAAACCGCA TAAACGAGTT GGAGCTGGAA 
AACAAACGGC TTCATGAAAC AGTTGCTTAT CTGACTCGTA AGCTGTACGG CAGAAGTTCT
GAGAAGACAT CAGCCCTTTC TGTGGGGCAG GTGTCTCTTT TTGATGAAGC AGAGGTTTAT
GCTGTTCCGC AGGCACCGGA GCCTGATCTT AAAGAAGTAC AGGGCTACAT TAGAAGGAAG
TACAAGGGCC AGAGGACTGA TCTTTTAAAA GACATCCCTC ATGACAAACG TCTCTGTACA
CTTGCAGAAG AAGACCGCTA TTGTGAGGCT TGCGGAACAG ACCTCGTTTC TGTCGGAAAA
GAATTCATCC GCACTGAGAT CGAATTCATT CCTGCTAAGA TCCGGGTAAT CGACTATTAC
CGTGAAACCT TTGAATGCCG TACCTGTCGC AAAAATGGAG AGCCATATAT GGAAAAGTCG
CCAATGCCAT ATCCTGTGAT TCAGCATTCT ATGGCATCTC CTTCTACTGT AGCATGGATT
ATGCATCAGA AGTTTGTAAA CCATCTCCCT CTTTACCGCC AGGAAAATGA GTGGAAGATG
CTGGGTGTCA ATTTAAAGCG GGAGACTATG TCCAACTGGA TTCTGGCTGC AGCTCGTGAC
TGGCTGATGC CATTGGTGGA TTTGATGCAT AAAAAACTCC TGCAGGAAAA ATACCTGCAT
GCCGACGAAA CCACGGTTCA GGTGCTAAAT GAGGAAGGCC GGAGCAACAC CACGAACTCA
TACATGTGGG TATACAGTAG CGGGAAGTAC TGTAAAAAGC AGATCAGGCT CTTCCAGTAC
CAGCCCGGGC GTAATGGTAA ATATCCTCAG GAATTCCTTA AAGGGTTCAG TGGATTTCTA
CATACAGATG CTTACTCCGG GTATAAGAAA GTTCCGGAGA TTACAAGGTG TATGTGTTGG
ACACATCTTC GGCGATATTT CCGGGATGCA CTTCCGAAAG ATACCCAGAG TCCGGAAGCA
ACCATTCCAA GCCAGGGAAT AAGATTCTGC AACAAGCTGT TTGAAATTGA AGAGACTCTT
GAAAAACTTA CTCAGGAGCA GCGAAGATTG GAGCGTCTGA AACAGGAAAC ACCCGTTTTA
GAGGCCTTTT GGTCGTGGGT TGATTCGGTT AAAGACAAGG TCCTGCCAAA GTCTAAAATA
GGTGAAGCCA TTCAATATGC CCTGAATAAC AAGGAAGACT TCATGAACTA TCTTTTAGAC
GGTAACTGCT CCATATCTAA TAACCTCTCG GAGAACAGCA TTCGTCCCTT TACCCTGGGA
AGAAAAAACT GGCTGTTCAG CGGAAGCCCG AGAGGAGCGG ATGCAAGCGC TGCTGTTTAT
AGCATTGTCG AAAGTGCTAA GGCTAACGAT ATTAACCCAT ATAAATATCT TTATTACATC
TTTAGCGAAC TACCGGGTGT GCAGTTCGGC CAGAATCCTG AATTCCTGGA AGATTATCTC
CCATGGAGTC CCGATGTACA AGCCGCCTGT AAATAG
 
Protein sequence
MSTAEQIATL ENRINELELE NKRLHETVAY LTRKLYGRSS EKTSALSVGQ VSLFDEAEVY 
AVPQAPEPDL KEVQGYIRRK YKGQRTDLLK DIPHDKRLCT LAEEDRYCEA CGTDLVSVGK
EFIRTEIEFI PAKIRVIDYY RETFECRTCR KNGEPYMEKS PMPYPVIQHS MASPSTVAWI
MHQKFVNHLP LYRQENEWKM LGVNLKRETM SNWILAAARD WLMPLVDLMH KKLLQEKYLH
ADETTVQVLN EEGRSNTTNS YMWVYSSGKY CKKQIRLFQY QPGRNGKYPQ EFLKGFSGFL
HTDAYSGYKK VPEITRCMCW THLRRYFRDA LPKDTQSPEA TIPSQGIRFC NKLFEIEETL
EKLTQEQRRL ERLKQETPVL EAFWSWVDSV KDKVLPKSKI GEAIQYALNN KEDFMNYLLD
GNCSISNNLS ENSIRPFTLG RKNWLFSGSP RGADASAAVY SIVESAKAND INPYKYLYYI
FSELPGVQFG QNPEFLEDYL PWSPDVQAAC K