Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2737 |
Symbol | uvrC |
ID | 4810239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3228762 |
End bp | 3230639 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108156 |
Product | excinuclease ABC subunit C |
Protein accession | YP_001039129 |
Protein GI | 125975219 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | [TIGR00194] excinuclease ABC, C subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTGACA TTCAGGAGGA ATTGAAAAAA CTGCCGGACA AGCCGGGCGT CTACATTATG AAAGATGAAA ACGGCGAAAT AATTTATGTG GGGAAAGCCG TTGTTTTAAA AAACAGGGTA AGACAGTATT TTCAATCCTT GTCAAACCAG ACTCCCAAAG TCAGAGCGAT GGTTGCCCAC ATCAAGGAAT TTGAGTATAT AGTAACCGAT ACGGAGCTTG AGGCTTTGAT ACTTGAATGC AACCTTATTA AAAAACACCG GCCGAAGTTT AATATATTAT TGAAGGATGA CAAAAACTAT CCTTATATAA AGGTTACCAT GAACGAAGAT TTTCCCCGTA TTTTAATGAC CCGCAGGGTT GAAAAGGACG GGGCAAAATA TTTCGGACCG TATACCAGCG CATATGCCGT CCGCGAGACG ATTGACCTTG TAAAAAAGCT GTTTCCCGTA AAAACATGCA GCAAAGTGCT TCCAAGAGAT ATCGGAAAAG GGCGGCCGTG CCTTAACTAT CATATATATC AGTGCCTTGG CCCCTGCCAG GGAAATGTGA GCAAGGAAGA ATACAGGTTT ATGATGCAGG ATGTGTGCAA CTTCCTGGGG GGAAGACAGG AGGATATAAT TAAGAAGCTT GAGAAGGATA TGAAAGAGGC GGCGGACAAT CTGGAGTTTG AGAGGGCCGC AAGAATAAGG GACAAAATCA ACAGCTTGAA GCATATTGCG GAGAAACAAA AAATTATATC CACTGCGATG GAAGACCAGG ATGTCATAGC TTTTGCAAAG AGTGAAACCG ACTCATGCAT ACAGGTGTTT TTCATCAGGG GAGGCAAGCT GATTGGGCGT GAACACTTTA TTTTGGAAGG AACGTCCGAT GTCAGCGACA GCGAATTGAT GACTGCATTT GTAAAGCAGT TTTACAGCAG TGCCGCTTAT GTACCCGGTC AGATAATTCT GCAAGAGGAC ATTGATGAAA TGGAAATTAT TGAGAAATGG CTGAGTGGAA AAAGAGGGAC GAAGACTTAT ATAAAGGTGC CGAGAAGAGG GGAAAAACTG AAGCTTGTTG AGATGGTATC CAAAAATGCC CTGATTGAGC TTAACCAATT TAAAGAAAGA ATTAAGAAAG AGGCGGCACT GGCAAAGGAA GGTATGGAAA AGCTGAAAGA GCTTTTAAAT CTTGACAGGC TCCCGAGAAG AATAGAAGCG TATGACATAT CCAACACAGG AAGCACTGAA ATTGTGGGTT CAATGGTCGT CTTTGAAAAC GGGTCTCCTA AAAAAAGTGA TTACAGGAGG TTTAAAATAA AGTCAATCAA TGTACAGAAT GATTATCAGA GCATGCAGGA GGTTATTTTC AGACGGCTTA AACGGGCCCA AAAAGAAATG ACGGAAAAAG ATGAAGGCGG CGGAAAAGAT GTTGGCGAAA AAGGCGCAGG ATTTGGAACG CTTCCGGATG TTTTGCTTGT GGACGGGGGA ACCGGGCATG TGAATGCTGT GCGCAGCGTG TTGGAGGAAC TTGATTTTAA CATTCCCGTA TATGGAATGG TAAAGGACGA TAACCACAGA ACGAGAGGAC TGGTCACGGG AGAACGTGAA TTTGATTTGT CAAAGGACAT TGTGCTGCTA AGGTTTGTGA CGGCCATTCA GGACGAAGCC CACAGATTTG CTTTGGAATA TAACAGAAAG CTCAGAGCAA AAAGGTACAG CGGCTCTGTG CTTGACAATA TAGAAGGAGT GGGACCAAAG CGCAAAAAAG AATTGATCAG GCATTTTGGT TCGGTTAAAG CCATAAAAGA GGCTGAGCCG GGCGAAATTG CGAAAGTTAA AGGAATCAGC AGGGATTTGG CACAAAAAAT ATATGATTAT TTCAGACAAC AGGAGTAA
|
Protein sequence | MFDIQEELKK LPDKPGVYIM KDENGEIIYV GKAVVLKNRV RQYFQSLSNQ TPKVRAMVAH IKEFEYIVTD TELEALILEC NLIKKHRPKF NILLKDDKNY PYIKVTMNED FPRILMTRRV EKDGAKYFGP YTSAYAVRET IDLVKKLFPV KTCSKVLPRD IGKGRPCLNY HIYQCLGPCQ GNVSKEEYRF MMQDVCNFLG GRQEDIIKKL EKDMKEAADN LEFERAARIR DKINSLKHIA EKQKIISTAM EDQDVIAFAK SETDSCIQVF FIRGGKLIGR EHFILEGTSD VSDSELMTAF VKQFYSSAAY VPGQIILQED IDEMEIIEKW LSGKRGTKTY IKVPRRGEKL KLVEMVSKNA LIELNQFKER IKKEAALAKE GMEKLKELLN LDRLPRRIEA YDISNTGSTE IVGSMVVFEN GSPKKSDYRR FKIKSINVQN DYQSMQEVIF RRLKRAQKEM TEKDEGGGKD VGEKGAGFGT LPDVLLVDGG TGHVNAVRSV LEELDFNIPV YGMVKDDNHR TRGLVTGERE FDLSKDIVLL RFVTAIQDEA HRFALEYNRK LRAKRYSGSV LDNIEGVGPK RKKELIRHFG SVKAIKEAEP GEIAKVKGIS RDLAQKIYDY FRQQE
|
| |