Gene Cthe_2410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2410 
Symbol 
ID4808125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2876249 
End bp2877841 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content40% 
IMG OID640107823 
Producthypothetical protein 
Protein accessionYP_001038805 
Protein GI125974895 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGG AAAATATTAT ATATTTTTCT GATTATATCT GCATAACGAA AGGGGCTGAC 
GGCTTTTATA TAGAATCCTA TAAAAAAGGC ATGTCAGTCG ATGAATTTAA TAAAATAATA
GGCCGGCATC CTGAAATAAA GATTACCAGT TTTATGGCAA TTAAAAATGC AATACTCTTT
GCGCCAAAAC CTCCGGTCAA ATTTGGTGAA GTTAAAGACA GAATCAGTGT TGAGCTCTCA
AGTGATGAGT TAAAAGCGTA TATCAGGCTT TGCGTGGAAG AATGGGAGTT TTCCGGGGAT
GCAAGGGTAA AGCTCATGGA GGAGATATCA AAGAGCCTGG AAAAAGCAGG AGTTGTATTT
GGAATTAAAG AGGATGTTTT GCTGGACGGA CTTTGCAACA ACAAGCAAAT TTTGATAGCC
GAAGGCATAC CTCCGGAGCA TGGCGAAGAT GCGGTAATAA GAATGTATGA AATAAAAAAA
GCAAAGCCTG CGATAAAAGA GGACGGCAGA GTGGATCATT ATGAGCTTAA CCTTATAAAC
AAGGTGAAAA CCGGAGACTG GTTGGGAGAA AGAATAGATC CCACTCCGGG AACTGCCGGC
AAATCGGTAA AGGGAAATCC GATACCCGCA AGACCCGGAA GAAATTATCC ATTGCATTAT
GACAAAAACT CAGTCAGAGA AGAACGCAAA GGCGGAGTGA CATATCTTTA TGCGCTGAAA
AGTGGGGCGG TACACTATGA AGGAGACAGG ATAAGTGTAT CCAATCACCT GGAGATAGAC
GGGGATGTGG ACTTTAAAAC GGGAAATATT AATTTTGACG GTTTTGTGAC TATAAAGGGA
ACTGTTGCGG ACGGATTTTC CGTAGTGGCA GTCAAAGATG TGGAAATACT TGGGACCATT
GGTATAGGTA GTGTAAAAGA AGTGGTCAGC AAAGAGGGGA GCATCTATAT CAAGGGTGGA
ATCGCCGGTA AAAACAAGGC GGTAATAAAG GCAAAAAAGG ATGTTTACAC AAAATATATT
TCCGATGCCA CTGTTGCTTG CGAAGGGAGT CTTCATGTGG GGCTTTATTG CATCAACAGC
AATATTACAG CCAGAGAGAT TATAATTGAC TCGCCGAAAG GACAAATATC AGGTGGAAAT
ATACAGTGTG AAACAAAAGT GTTATCCCCG GTTTTAGGTT CACCCAGTGA GAAACGTACG
GTTATATCGG TCAAAGGATT TAACAGAAAC ACCCTGAAAG AAAGGCTTGA GGAAGTGATG
AAAAATATAG AGACTTTGAA AAATGAATTG GTAAAAGTAA AAGCTGAGGT AAATGCCTAT
TTTGAAAATG AACAAAATGG GAAGGTCGGG AGTTTGAAAG CAGAAGACAT CAGACAGAGG
TTTAACCGTA TAAAAAATGA ATTAACGGAG CTTGAGGAAG AGAAAAAAGC GATTTCCGAT
ACATTAAGAA CCAGGGGGGA AGGAGAAATA TCCATATTAA AAAAAGCTTA CCCCGGTGTT
GTTATTGGAA TAAAAAATAT TATTAAGGAA ATAGACAGGC CGATAGTAAA TACCACTTTC
TATATACAGG ACGGATATAT AAAAGAGGTA TAG
 
Protein sequence
MSQENIIYFS DYICITKGAD GFYIESYKKG MSVDEFNKII GRHPEIKITS FMAIKNAILF 
APKPPVKFGE VKDRISVELS SDELKAYIRL CVEEWEFSGD ARVKLMEEIS KSLEKAGVVF
GIKEDVLLDG LCNNKQILIA EGIPPEHGED AVIRMYEIKK AKPAIKEDGR VDHYELNLIN
KVKTGDWLGE RIDPTPGTAG KSVKGNPIPA RPGRNYPLHY DKNSVREERK GGVTYLYALK
SGAVHYEGDR ISVSNHLEID GDVDFKTGNI NFDGFVTIKG TVADGFSVVA VKDVEILGTI
GIGSVKEVVS KEGSIYIKGG IAGKNKAVIK AKKDVYTKYI SDATVACEGS LHVGLYCINS
NITAREIIID SPKGQISGGN IQCETKVLSP VLGSPSEKRT VISVKGFNRN TLKERLEEVM
KNIETLKNEL VKVKAEVNAY FENEQNGKVG SLKAEDIRQR FNRIKNELTE LEEEKKAISD
TLRTRGEGEI SILKKAYPGV VIGIKNIIKE IDRPIVNTTF YIQDGYIKEV