Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0311 |
Symbol | |
ID | 4808529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 389538 |
End bp | 392366 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105722 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001036742 |
Protein GI | 125972832 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00463978 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG ATTATATTGT TGTAAAGGGT GCCAGAGAGC ACAATTTGAA AAACATAGAT GTCAAGATTC CCAGGGACAA GTTTGTTGTT ATCACCGGAC TGAGCGGATC AGGCAAATCA TCCCTTGCTT TTGACACAAT ATATGCGGAG GGACAAAGGC GCTACGTTGA GTCATTGTCC TCGTATGCAA GACAGTTTTT AGGACAAATG GAAAAACCTG ATGTCGATTA TATTGACGGA CTGTCGCCGG CCATAGCGAT AGATCAGAAA ACCACAAGCC GCAATCCCCG TTCCACTGTG GGCACGGTTA CGGAGATATA TGATTATTTA AGGCTTCTTT TTGCAAGAAT AGGCACTCCC CACTGCTACT TATGCGGAAG GGAAATTTCC CAGCAAACGG TGGACCAGAT GGTGGACAGA ATTATGGAGT TTGAAGAAGG CACACGGATT CAGCTTCTTG CTCCTGTGGT AAGAGGAAGA AAAGGTGAGT ATCACAAGCT CATAGAAGAT ATAAAGAAGG AAGGCTATGT CAGGATTAGA GTGGATGGAG AGGTAGTGGA TGTAAATGAC CCTGTAAACC TCGACAAGAA CAAGAAGCAC AATATTGAAA TTGTGGTGGA CAGGCTGATT GTGAGACCGG GAATTCAGAA AAGGTTGACA GATTCCATTG AGACTGTTCT GCGCTTAAGC AACGGCATAC TTGTGGTTGA TGTCATAGGC GGAAAGGAGA TGCTCCTAAG CCAAAACTTT GCATGTACCG AATGTAACGT GAGCATGGAG GAAATAACGC CCAGAATGTT TTCTTTCAAC AATCCTTACG GTGCCTGTCC CGAATGTACT GGTCTGGGCT CTCTTATGAG GATAGACCCT GACCTTGTCA TACCGGACAA AAAACTTTCT CTGGCCCAGG GAGCCGTCAG GGCGTCAGGA TGGAATATAG CAAATGATGA AAGCTATGCC AGAATGTATA TAGACGCTCT TGCAAAACAT TATAATTTCA GCGTGGATAC CCCTGTTGAG GAGCTTCCCC CGCATATTCT TGACATTATA CTCTATGGCA CCAACGGGGA AAAAATTAAA ATAGAATATG AAAGGGAAAA TGAAAAAGGA ACATTCATGG CAAGCTTCCC GGGAATTATA AACAGTATGG AGAGAAGATA CAAAGAGACA ACTTCGGAAG TAATGAAGCA GTACTATGAA AACTTTATGA GCAATATACC CTGTCCTGTC TGCAAGGGGG CGAGATTGAA AAAGGAAAGT CTTGCAGTGA CAATAGGCGG CAAAAATATA TATGAAGTTT GCTGCTTGTC CATTGGAGAA GCAAAAGAGT TTTTCGCAAA TTTAAACCTT ACGGAAAGGC AGCAGCTTAT TGCCCGCCAG ATCTTGAAGG AAATAAATGC AAGACTGGGA TTTTTGGTGG ATGTGGGGCT TGACTACCTC ACCCTTGCGA GAGCGGCAGG AACACTGTCC GGAGGTGAAG CCCAGAGAAT CAGGCTTGCC ACACAAATTG GCTCGGGACT TATGGGAGTT ATATATATCC TGGACGAGCC CAGCATAGGT CTTCATCAGA GGGATAACGA CAGGCTCCTC AGAAGTCTCA AGAAGCTAAG GGATTTGGGA AATACTTTGC TGGTGGTTGA ACATGATGAG GATACAATGT ATGCGTCGGA TTACATTATT GATTTGGGAC CGGGTGCGGG AAGCCACGGA GGACAAATAG TTGCGGAAGG TACTGTGGAA GAGATTAAAC AAAATCCCAA TTCCGTTACG GGAGAGTATC TTAGCGGCAG AAAGAAAATT GAAGTTCCTA AAGAAAGAAG AAAACCCAAT GGGAAATGGC TGGAAATTAT AGGAGCAAGA GAAAATAATC TTAAAAATAT AAATGTAAGA ATACCTTTAG GAGTGTTTAC GTGCATTACA GGGGTTTCAG GATCCGGGAA GAGTTCTCTG ATAAATGAAA TTTTGTACAA GCGATTGGCC GCCGAGCTTA ACAGAGCAAG TGTAAAACCG GGCGAGCATG ACTTGATAAA AGGAATTGAG TATCTTGACA AAGTTATAGA TATCGACCAG TCGCCCATTG GCCGCACGCC AAGGTCCAAC CCTGCAACAT ACACAGGTGT GTTTGATTTT ATAAGGGAAA TATTTGCAAA CACCACTGAA GCAAAAACCC GGGGGTACAA GGCGGGACGT TTCAGTTTTA ATGTAAAGGG CGGCAGATGC GAAGCCTGTG CCGGTGACGG TATAAACAAA ATTGAAATGC ACTTTTTACC GGACATTTAT GTTCCCTGTG AGGTTTGCAA GGGCAAGCGC TACAACAGAG AGACCCTTGA AGTAAGATAC AAAGGAAAAA ATATAGCGGA AGTTCTGGAT ATGACTGTGG AAGAGGCATT GGAGTTCTTT AAGAATATAC CAAGGATACA CAAAAAGATA GAAACATTGT ATGATGTGGG TCTTGGTTAT ATCAAACTGG GACAGTCGTC CACCACTCTG TCCGGAGGCG AGGCTCAGAG GGTAAAACTT GCCACCGAGC TTTCGAGAAA GAGCACTGGA AAAACAATGT ATATACTGGA TGAGCCGACT ACAGGCCTTC ATATGGCTGA TGTGCACAGG CTTGTCGGCA TACTTCACAG ACTGGTGGAG GCGGGAAATT CTGTAGTGGT TATTGAACAT AACCTTGACG TAATAAAAAC TGCCGATTAT ATTATTGATT TGGGACCTGA AGGTGGCAGC GGAGGAGGTC TCGTTGTTGC CGAGGGGACA CCGGAAGAAG TGGCAAAGGT TGAAAATTCT TATACAGGAC AGTTTTTGAA AAAAGTTTTG TCCACTTAA
|
Protein sequence | MKKDYIVVKG AREHNLKNID VKIPRDKFVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS SYARQFLGQM EKPDVDYIDG LSPAIAIDQK TTSRNPRSTV GTVTEIYDYL RLLFARIGTP HCYLCGREIS QQTVDQMVDR IMEFEEGTRI QLLAPVVRGR KGEYHKLIED IKKEGYVRIR VDGEVVDVND PVNLDKNKKH NIEIVVDRLI VRPGIQKRLT DSIETVLRLS NGILVVDVIG GKEMLLSQNF ACTECNVSME EITPRMFSFN NPYGACPECT GLGSLMRIDP DLVIPDKKLS LAQGAVRASG WNIANDESYA RMYIDALAKH YNFSVDTPVE ELPPHILDII LYGTNGEKIK IEYERENEKG TFMASFPGII NSMERRYKET TSEVMKQYYE NFMSNIPCPV CKGARLKKES LAVTIGGKNI YEVCCLSIGE AKEFFANLNL TERQQLIARQ ILKEINARLG FLVDVGLDYL TLARAAGTLS GGEAQRIRLA TQIGSGLMGV IYILDEPSIG LHQRDNDRLL RSLKKLRDLG NTLLVVEHDE DTMYASDYII DLGPGAGSHG GQIVAEGTVE EIKQNPNSVT GEYLSGRKKI EVPKERRKPN GKWLEIIGAR ENNLKNINVR IPLGVFTCIT GVSGSGKSSL INEILYKRLA AELNRASVKP GEHDLIKGIE YLDKVIDIDQ SPIGRTPRSN PATYTGVFDF IREIFANTTE AKTRGYKAGR FSFNVKGGRC EACAGDGINK IEMHFLPDIY VPCEVCKGKR YNRETLEVRY KGKNIAEVLD MTVEEALEFF KNIPRIHKKI ETLYDVGLGY IKLGQSSTTL SGGEAQRVKL ATELSRKSTG KTMYILDEPT TGLHMADVHR LVGILHRLVE AGNSVVVIEH NLDVIKTADY IIDLGPEGGS GGGLVVAEGT PEEVAKVENS YTGQFLKKVL ST
|
| |