Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0833 |
Symbol | |
ID | 4810451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1013569 |
End bp | 1014804 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106250 |
Product | exodeoxyribonuclease VII large subunit |
Protein accession | YP_001037261 |
Protein GI | 125973351 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1570] Exonuclease VII, large subunit |
TIGRFAM ID | [TIGR00237] exodeoxyribonuclease VII, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0015301 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGAGT TTTTTTCTGA CAATTGTGAC AACATATTAA CGGTTTCAGC AGTCAACAGA TATATCAAGG AAATCATGTC AAGGGATCTG ATCCTGTCAA ATCTTTGGGT TAGAGGCGAA ATATCGAATT TTAAATATCA TTCTTCGGGT CATATGTATT TTACCCTTAA GGATGAGAAC TGTTCACTAA AATGTGTGAT GTTCAGAACA TACAACTTGC ACCTTAAATT TATGCCTGAA AACGGCATGA AGGTGATAGT AAAGGGCTAT ATTTCGGTAT TTGAAAGGGA CGGACAATAT CAGCTCTATG CTGAGGAAAT GCAAAATGAC GGTATAGGAG ACCTTTATAT TGCTTTTGAA CAGCTAAAGA GAAGACTTGC AAGCGAAGGT CTTTTTGATC CGGCACACAA GAAAAAGATA CCGTTTATGC CGAGGACAAT AGGAGTGGTT ACCTCTGCCA CCGGTTCGGT TATCAGAGAT ATTATGAATA TTTTGGACAG ACGGTTCTAT AATTCATATA TAAAGATATT TCCTGTCAGG GTCCAGGGTG AAACCGCCGC TTTGGAAATA AGCCATGCGA TAAGCAAATT GAATGAAATC GGCGGTGTGG ATGTCATTAT CCTTGCCAGA GGTGGAGGCT CTTTGGAGGA ATTATGGCCG TTTAACGAGG AAATAGTGGC AAGAAGCATA TTTAATTCTT CCATACCGGT AATATCGGCC GTGGGACATG AGACGGACTA TACAATAGCG GATTTTGTTG CAGATTTAAG GGCGCCCACT CCATCAGCGG CCGCCGAATT GGTAATGCCT GAAAAAGTAA CTATTATAAA CAGAATAAGA GAGCTTAATG TCAGGATGGT GGACGCACTT CAAAGAAATG TAAAGCAAAA AAGGGATATG CTTAAAAAAC TTGCCGATTC AGTAGTTTTC AGGCAGCCAT ATGACAGAAT ATATCAGGAA AGAATGAAGC TGGACATTTT AAACAGGGAC TTGAAAAAGA GCATGTTTGC TTCTTTAGAG AGGGCAGGAT CAAAGCTTGG ATTTTTGATA GGAAAACTTG ACGCATTAAG CCCCCTTACT ATATTATCAA GGGGATACGG AATTATAAAG TCGGAAGAAA AAGGGATTTT TGTAAAATCC GTTAACGATG TGGATGTCGG AGAAGGAATT GAAGTGAGTG TGAAAGACGG AAGGCTTTAC TGCACGGTAA GGAAGAAGGA ATTGAATGAT GATTAA
|
Protein sequence | MGEFFSDNCD NILTVSAVNR YIKEIMSRDL ILSNLWVRGE ISNFKYHSSG HMYFTLKDEN CSLKCVMFRT YNLHLKFMPE NGMKVIVKGY ISVFERDGQY QLYAEEMQND GIGDLYIAFE QLKRRLASEG LFDPAHKKKI PFMPRTIGVV TSATGSVIRD IMNILDRRFY NSYIKIFPVR VQGETAALEI SHAISKLNEI GGVDVIILAR GGGSLEELWP FNEEIVARSI FNSSIPVISA VGHETDYTIA DFVADLRAPT PSAAAELVMP EKVTIINRIR ELNVRMVDAL QRNVKQKRDM LKKLADSVVF RQPYDRIYQE RMKLDILNRD LKKSMFASLE RAGSKLGFLI GKLDALSPLT ILSRGYGIIK SEEKGIFVKS VNDVDVGEGI EVSVKDGRLY CTVRKKELND D
|
| |