Gene Cthe_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2737 
SymboluvrC 
ID4810239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3228762 
End bp3230639 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content43% 
IMG OID640108156 
Productexcinuclease ABC subunit C 
Protein accessionYP_001039129 
Protein GI125975219 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTGACA TTCAGGAGGA ATTGAAAAAA CTGCCGGACA AGCCGGGCGT CTACATTATG 
AAAGATGAAA ACGGCGAAAT AATTTATGTG GGGAAAGCCG TTGTTTTAAA AAACAGGGTA
AGACAGTATT TTCAATCCTT GTCAAACCAG ACTCCCAAAG TCAGAGCGAT GGTTGCCCAC
ATCAAGGAAT TTGAGTATAT AGTAACCGAT ACGGAGCTTG AGGCTTTGAT ACTTGAATGC
AACCTTATTA AAAAACACCG GCCGAAGTTT AATATATTAT TGAAGGATGA CAAAAACTAT
CCTTATATAA AGGTTACCAT GAACGAAGAT TTTCCCCGTA TTTTAATGAC CCGCAGGGTT
GAAAAGGACG GGGCAAAATA TTTCGGACCG TATACCAGCG CATATGCCGT CCGCGAGACG
ATTGACCTTG TAAAAAAGCT GTTTCCCGTA AAAACATGCA GCAAAGTGCT TCCAAGAGAT
ATCGGAAAAG GGCGGCCGTG CCTTAACTAT CATATATATC AGTGCCTTGG CCCCTGCCAG
GGAAATGTGA GCAAGGAAGA ATACAGGTTT ATGATGCAGG ATGTGTGCAA CTTCCTGGGG
GGAAGACAGG AGGATATAAT TAAGAAGCTT GAGAAGGATA TGAAAGAGGC GGCGGACAAT
CTGGAGTTTG AGAGGGCCGC AAGAATAAGG GACAAAATCA ACAGCTTGAA GCATATTGCG
GAGAAACAAA AAATTATATC CACTGCGATG GAAGACCAGG ATGTCATAGC TTTTGCAAAG
AGTGAAACCG ACTCATGCAT ACAGGTGTTT TTCATCAGGG GAGGCAAGCT GATTGGGCGT
GAACACTTTA TTTTGGAAGG AACGTCCGAT GTCAGCGACA GCGAATTGAT GACTGCATTT
GTAAAGCAGT TTTACAGCAG TGCCGCTTAT GTACCCGGTC AGATAATTCT GCAAGAGGAC
ATTGATGAAA TGGAAATTAT TGAGAAATGG CTGAGTGGAA AAAGAGGGAC GAAGACTTAT
ATAAAGGTGC CGAGAAGAGG GGAAAAACTG AAGCTTGTTG AGATGGTATC CAAAAATGCC
CTGATTGAGC TTAACCAATT TAAAGAAAGA ATTAAGAAAG AGGCGGCACT GGCAAAGGAA
GGTATGGAAA AGCTGAAAGA GCTTTTAAAT CTTGACAGGC TCCCGAGAAG AATAGAAGCG
TATGACATAT CCAACACAGG AAGCACTGAA ATTGTGGGTT CAATGGTCGT CTTTGAAAAC
GGGTCTCCTA AAAAAAGTGA TTACAGGAGG TTTAAAATAA AGTCAATCAA TGTACAGAAT
GATTATCAGA GCATGCAGGA GGTTATTTTC AGACGGCTTA AACGGGCCCA AAAAGAAATG
ACGGAAAAAG ATGAAGGCGG CGGAAAAGAT GTTGGCGAAA AAGGCGCAGG ATTTGGAACG
CTTCCGGATG TTTTGCTTGT GGACGGGGGA ACCGGGCATG TGAATGCTGT GCGCAGCGTG
TTGGAGGAAC TTGATTTTAA CATTCCCGTA TATGGAATGG TAAAGGACGA TAACCACAGA
ACGAGAGGAC TGGTCACGGG AGAACGTGAA TTTGATTTGT CAAAGGACAT TGTGCTGCTA
AGGTTTGTGA CGGCCATTCA GGACGAAGCC CACAGATTTG CTTTGGAATA TAACAGAAAG
CTCAGAGCAA AAAGGTACAG CGGCTCTGTG CTTGACAATA TAGAAGGAGT GGGACCAAAG
CGCAAAAAAG AATTGATCAG GCATTTTGGT TCGGTTAAAG CCATAAAAGA GGCTGAGCCG
GGCGAAATTG CGAAAGTTAA AGGAATCAGC AGGGATTTGG CACAAAAAAT ATATGATTAT
TTCAGACAAC AGGAGTAA
 
Protein sequence
MFDIQEELKK LPDKPGVYIM KDENGEIIYV GKAVVLKNRV RQYFQSLSNQ TPKVRAMVAH 
IKEFEYIVTD TELEALILEC NLIKKHRPKF NILLKDDKNY PYIKVTMNED FPRILMTRRV
EKDGAKYFGP YTSAYAVRET IDLVKKLFPV KTCSKVLPRD IGKGRPCLNY HIYQCLGPCQ
GNVSKEEYRF MMQDVCNFLG GRQEDIIKKL EKDMKEAADN LEFERAARIR DKINSLKHIA
EKQKIISTAM EDQDVIAFAK SETDSCIQVF FIRGGKLIGR EHFILEGTSD VSDSELMTAF
VKQFYSSAAY VPGQIILQED IDEMEIIEKW LSGKRGTKTY IKVPRRGEKL KLVEMVSKNA
LIELNQFKER IKKEAALAKE GMEKLKELLN LDRLPRRIEA YDISNTGSTE IVGSMVVFEN
GSPKKSDYRR FKIKSINVQN DYQSMQEVIF RRLKRAQKEM TEKDEGGGKD VGEKGAGFGT
LPDVLLVDGG TGHVNAVRSV LEELDFNIPV YGMVKDDNHR TRGLVTGERE FDLSKDIVLL
RFVTAIQDEA HRFALEYNRK LRAKRYSGSV LDNIEGVGPK RKKELIRHFG SVKAIKEAEP
GEIAKVKGIS RDLAQKIYDY FRQQE