Gene Cthe_0311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0311 
Symbol 
ID4808529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp389538 
End bp392366 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content44% 
IMG OID640105722 
Productexcinuclease ABC subunit A 
Protein accessionYP_001036742 
Protein GI125972832 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00463978 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG ATTATATTGT TGTAAAGGGT GCCAGAGAGC ACAATTTGAA AAACATAGAT 
GTCAAGATTC CCAGGGACAA GTTTGTTGTT ATCACCGGAC TGAGCGGATC AGGCAAATCA
TCCCTTGCTT TTGACACAAT ATATGCGGAG GGACAAAGGC GCTACGTTGA GTCATTGTCC
TCGTATGCAA GACAGTTTTT AGGACAAATG GAAAAACCTG ATGTCGATTA TATTGACGGA
CTGTCGCCGG CCATAGCGAT AGATCAGAAA ACCACAAGCC GCAATCCCCG TTCCACTGTG
GGCACGGTTA CGGAGATATA TGATTATTTA AGGCTTCTTT TTGCAAGAAT AGGCACTCCC
CACTGCTACT TATGCGGAAG GGAAATTTCC CAGCAAACGG TGGACCAGAT GGTGGACAGA
ATTATGGAGT TTGAAGAAGG CACACGGATT CAGCTTCTTG CTCCTGTGGT AAGAGGAAGA
AAAGGTGAGT ATCACAAGCT CATAGAAGAT ATAAAGAAGG AAGGCTATGT CAGGATTAGA
GTGGATGGAG AGGTAGTGGA TGTAAATGAC CCTGTAAACC TCGACAAGAA CAAGAAGCAC
AATATTGAAA TTGTGGTGGA CAGGCTGATT GTGAGACCGG GAATTCAGAA AAGGTTGACA
GATTCCATTG AGACTGTTCT GCGCTTAAGC AACGGCATAC TTGTGGTTGA TGTCATAGGC
GGAAAGGAGA TGCTCCTAAG CCAAAACTTT GCATGTACCG AATGTAACGT GAGCATGGAG
GAAATAACGC CCAGAATGTT TTCTTTCAAC AATCCTTACG GTGCCTGTCC CGAATGTACT
GGTCTGGGCT CTCTTATGAG GATAGACCCT GACCTTGTCA TACCGGACAA AAAACTTTCT
CTGGCCCAGG GAGCCGTCAG GGCGTCAGGA TGGAATATAG CAAATGATGA AAGCTATGCC
AGAATGTATA TAGACGCTCT TGCAAAACAT TATAATTTCA GCGTGGATAC CCCTGTTGAG
GAGCTTCCCC CGCATATTCT TGACATTATA CTCTATGGCA CCAACGGGGA AAAAATTAAA
ATAGAATATG AAAGGGAAAA TGAAAAAGGA ACATTCATGG CAAGCTTCCC GGGAATTATA
AACAGTATGG AGAGAAGATA CAAAGAGACA ACTTCGGAAG TAATGAAGCA GTACTATGAA
AACTTTATGA GCAATATACC CTGTCCTGTC TGCAAGGGGG CGAGATTGAA AAAGGAAAGT
CTTGCAGTGA CAATAGGCGG CAAAAATATA TATGAAGTTT GCTGCTTGTC CATTGGAGAA
GCAAAAGAGT TTTTCGCAAA TTTAAACCTT ACGGAAAGGC AGCAGCTTAT TGCCCGCCAG
ATCTTGAAGG AAATAAATGC AAGACTGGGA TTTTTGGTGG ATGTGGGGCT TGACTACCTC
ACCCTTGCGA GAGCGGCAGG AACACTGTCC GGAGGTGAAG CCCAGAGAAT CAGGCTTGCC
ACACAAATTG GCTCGGGACT TATGGGAGTT ATATATATCC TGGACGAGCC CAGCATAGGT
CTTCATCAGA GGGATAACGA CAGGCTCCTC AGAAGTCTCA AGAAGCTAAG GGATTTGGGA
AATACTTTGC TGGTGGTTGA ACATGATGAG GATACAATGT ATGCGTCGGA TTACATTATT
GATTTGGGAC CGGGTGCGGG AAGCCACGGA GGACAAATAG TTGCGGAAGG TACTGTGGAA
GAGATTAAAC AAAATCCCAA TTCCGTTACG GGAGAGTATC TTAGCGGCAG AAAGAAAATT
GAAGTTCCTA AAGAAAGAAG AAAACCCAAT GGGAAATGGC TGGAAATTAT AGGAGCAAGA
GAAAATAATC TTAAAAATAT AAATGTAAGA ATACCTTTAG GAGTGTTTAC GTGCATTACA
GGGGTTTCAG GATCCGGGAA GAGTTCTCTG ATAAATGAAA TTTTGTACAA GCGATTGGCC
GCCGAGCTTA ACAGAGCAAG TGTAAAACCG GGCGAGCATG ACTTGATAAA AGGAATTGAG
TATCTTGACA AAGTTATAGA TATCGACCAG TCGCCCATTG GCCGCACGCC AAGGTCCAAC
CCTGCAACAT ACACAGGTGT GTTTGATTTT ATAAGGGAAA TATTTGCAAA CACCACTGAA
GCAAAAACCC GGGGGTACAA GGCGGGACGT TTCAGTTTTA ATGTAAAGGG CGGCAGATGC
GAAGCCTGTG CCGGTGACGG TATAAACAAA ATTGAAATGC ACTTTTTACC GGACATTTAT
GTTCCCTGTG AGGTTTGCAA GGGCAAGCGC TACAACAGAG AGACCCTTGA AGTAAGATAC
AAAGGAAAAA ATATAGCGGA AGTTCTGGAT ATGACTGTGG AAGAGGCATT GGAGTTCTTT
AAGAATATAC CAAGGATACA CAAAAAGATA GAAACATTGT ATGATGTGGG TCTTGGTTAT
ATCAAACTGG GACAGTCGTC CACCACTCTG TCCGGAGGCG AGGCTCAGAG GGTAAAACTT
GCCACCGAGC TTTCGAGAAA GAGCACTGGA AAAACAATGT ATATACTGGA TGAGCCGACT
ACAGGCCTTC ATATGGCTGA TGTGCACAGG CTTGTCGGCA TACTTCACAG ACTGGTGGAG
GCGGGAAATT CTGTAGTGGT TATTGAACAT AACCTTGACG TAATAAAAAC TGCCGATTAT
ATTATTGATT TGGGACCTGA AGGTGGCAGC GGAGGAGGTC TCGTTGTTGC CGAGGGGACA
CCGGAAGAAG TGGCAAAGGT TGAAAATTCT TATACAGGAC AGTTTTTGAA AAAAGTTTTG
TCCACTTAA
 
Protein sequence
MKKDYIVVKG AREHNLKNID VKIPRDKFVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
SYARQFLGQM EKPDVDYIDG LSPAIAIDQK TTSRNPRSTV GTVTEIYDYL RLLFARIGTP
HCYLCGREIS QQTVDQMVDR IMEFEEGTRI QLLAPVVRGR KGEYHKLIED IKKEGYVRIR
VDGEVVDVND PVNLDKNKKH NIEIVVDRLI VRPGIQKRLT DSIETVLRLS NGILVVDVIG
GKEMLLSQNF ACTECNVSME EITPRMFSFN NPYGACPECT GLGSLMRIDP DLVIPDKKLS
LAQGAVRASG WNIANDESYA RMYIDALAKH YNFSVDTPVE ELPPHILDII LYGTNGEKIK
IEYERENEKG TFMASFPGII NSMERRYKET TSEVMKQYYE NFMSNIPCPV CKGARLKKES
LAVTIGGKNI YEVCCLSIGE AKEFFANLNL TERQQLIARQ ILKEINARLG FLVDVGLDYL
TLARAAGTLS GGEAQRIRLA TQIGSGLMGV IYILDEPSIG LHQRDNDRLL RSLKKLRDLG
NTLLVVEHDE DTMYASDYII DLGPGAGSHG GQIVAEGTVE EIKQNPNSVT GEYLSGRKKI
EVPKERRKPN GKWLEIIGAR ENNLKNINVR IPLGVFTCIT GVSGSGKSSL INEILYKRLA
AELNRASVKP GEHDLIKGIE YLDKVIDIDQ SPIGRTPRSN PATYTGVFDF IREIFANTTE
AKTRGYKAGR FSFNVKGGRC EACAGDGINK IEMHFLPDIY VPCEVCKGKR YNRETLEVRY
KGKNIAEVLD MTVEEALEFF KNIPRIHKKI ETLYDVGLGY IKLGQSSTTL SGGEAQRVKL
ATELSRKSTG KTMYILDEPT TGLHMADVHR LVGILHRLVE AGNSVVVIEH NLDVIKTADY
IIDLGPEGGS GGGLVVAEGT PEEVAKVENS YTGQFLKKVL ST