Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2410 |
Symbol | |
ID | 4808125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2876249 |
End bp | 2877841 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107823 |
Product | hypothetical protein |
Protein accession | YP_001038805 |
Protein GI | 125974895 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAGG AAAATATTAT ATATTTTTCT GATTATATCT GCATAACGAA AGGGGCTGAC GGCTTTTATA TAGAATCCTA TAAAAAAGGC ATGTCAGTCG ATGAATTTAA TAAAATAATA GGCCGGCATC CTGAAATAAA GATTACCAGT TTTATGGCAA TTAAAAATGC AATACTCTTT GCGCCAAAAC CTCCGGTCAA ATTTGGTGAA GTTAAAGACA GAATCAGTGT TGAGCTCTCA AGTGATGAGT TAAAAGCGTA TATCAGGCTT TGCGTGGAAG AATGGGAGTT TTCCGGGGAT GCAAGGGTAA AGCTCATGGA GGAGATATCA AAGAGCCTGG AAAAAGCAGG AGTTGTATTT GGAATTAAAG AGGATGTTTT GCTGGACGGA CTTTGCAACA ACAAGCAAAT TTTGATAGCC GAAGGCATAC CTCCGGAGCA TGGCGAAGAT GCGGTAATAA GAATGTATGA AATAAAAAAA GCAAAGCCTG CGATAAAAGA GGACGGCAGA GTGGATCATT ATGAGCTTAA CCTTATAAAC AAGGTGAAAA CCGGAGACTG GTTGGGAGAA AGAATAGATC CCACTCCGGG AACTGCCGGC AAATCGGTAA AGGGAAATCC GATACCCGCA AGACCCGGAA GAAATTATCC ATTGCATTAT GACAAAAACT CAGTCAGAGA AGAACGCAAA GGCGGAGTGA CATATCTTTA TGCGCTGAAA AGTGGGGCGG TACACTATGA AGGAGACAGG ATAAGTGTAT CCAATCACCT GGAGATAGAC GGGGATGTGG ACTTTAAAAC GGGAAATATT AATTTTGACG GTTTTGTGAC TATAAAGGGA ACTGTTGCGG ACGGATTTTC CGTAGTGGCA GTCAAAGATG TGGAAATACT TGGGACCATT GGTATAGGTA GTGTAAAAGA AGTGGTCAGC AAAGAGGGGA GCATCTATAT CAAGGGTGGA ATCGCCGGTA AAAACAAGGC GGTAATAAAG GCAAAAAAGG ATGTTTACAC AAAATATATT TCCGATGCCA CTGTTGCTTG CGAAGGGAGT CTTCATGTGG GGCTTTATTG CATCAACAGC AATATTACAG CCAGAGAGAT TATAATTGAC TCGCCGAAAG GACAAATATC AGGTGGAAAT ATACAGTGTG AAACAAAAGT GTTATCCCCG GTTTTAGGTT CACCCAGTGA GAAACGTACG GTTATATCGG TCAAAGGATT TAACAGAAAC ACCCTGAAAG AAAGGCTTGA GGAAGTGATG AAAAATATAG AGACTTTGAA AAATGAATTG GTAAAAGTAA AAGCTGAGGT AAATGCCTAT TTTGAAAATG AACAAAATGG GAAGGTCGGG AGTTTGAAAG CAGAAGACAT CAGACAGAGG TTTAACCGTA TAAAAAATGA ATTAACGGAG CTTGAGGAAG AGAAAAAAGC GATTTCCGAT ACATTAAGAA CCAGGGGGGA AGGAGAAATA TCCATATTAA AAAAAGCTTA CCCCGGTGTT GTTATTGGAA TAAAAAATAT TATTAAGGAA ATAGACAGGC CGATAGTAAA TACCACTTTC TATATACAGG ACGGATATAT AAAAGAGGTA TAG
|
Protein sequence | MSQENIIYFS DYICITKGAD GFYIESYKKG MSVDEFNKII GRHPEIKITS FMAIKNAILF APKPPVKFGE VKDRISVELS SDELKAYIRL CVEEWEFSGD ARVKLMEEIS KSLEKAGVVF GIKEDVLLDG LCNNKQILIA EGIPPEHGED AVIRMYEIKK AKPAIKEDGR VDHYELNLIN KVKTGDWLGE RIDPTPGTAG KSVKGNPIPA RPGRNYPLHY DKNSVREERK GGVTYLYALK SGAVHYEGDR ISVSNHLEID GDVDFKTGNI NFDGFVTIKG TVADGFSVVA VKDVEILGTI GIGSVKEVVS KEGSIYIKGG IAGKNKAVIK AKKDVYTKYI SDATVACEGS LHVGLYCINS NITAREIIID SPKGQISGGN IQCETKVLSP VLGSPSEKRT VISVKGFNRN TLKERLEEVM KNIETLKNEL VKVKAEVNAY FENEQNGKVG SLKAEDIRQR FNRIKNELTE LEEEKKAISD TLRTRGEGEI SILKKAYPGV VIGIKNIIKE IDRPIVNTTF YIQDGYIKEV
|
| |