Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1700 |
Symbol | |
ID | 4808875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2022140 |
End bp | 2023468 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640107113 |
Product | recombinase |
Protein accession | YP_001038114 |
Protein GI | 125974204 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000219867 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCTT ATTGCAGAGT ATCAACAGAC CAATTAGAAC AGTTATCAAG CTATGAAGCA CAGGTAGCTT ATTATACATC TTATATAACA AATCATCCAG ATTATGAATT CGGAGGAATT TACGCAGATG AGGGGATTTC AGGGACAAAC ACTAAAAAGA GAGAACAGTT CAATAAAATG ATAGAAGACT GCAAAGCAGG GAAAATAGAT ATGATAATAA CCAAGTCTAT ATCAAGGTTT GCAAGAAATA CACTTGATAC ATTAAACTAC GTAAGACAGC TTAAAGAATT AGGTATTGGA GTAATATTTG AAAAGGAAAA TATTAATACT TTAGATTCAA AGGGAGAAGT ACTGCTTACA ATCCTCAGCT CCCTTGCCCA AGATGAATCA AGAAGTATAA GTGAAAATTC TACATGGGGC ATAAGGAGAA GGTTTGAACA GGGAAAGCTT CATATAAATC ATACAAAGTT TTTAGGCTAT GATAAAGATG AAGAAGGAAA TCTTGTGATA AATGAAAAGC AGGCTAAAAT TGTAAGAAGA ATATATAAGG ATTACCTTGA CGGTAAAGGT GCAAACAGAA TTGCAAGGGA ACTTGAAGAA GAAGGTGTTC CTAACTGGAA TGGAAAACCT AAGTGGTATG AAAGCAGTAT AAGAAAGATT TTAAGTAATG AAAAATATAA AGGAGATGCA CTTCTCCAAA AGACATATAC TGTTGATTTT CTAACCAAAA AAAGAGCAGT AAACAATGGC GAAGTTCCAA TGTATTATGT AGAAGAAAGC CATCCTGCAA TTATAGATAA AGAAATTTGG GAAGCGGTAC AGCTAGAGAT GGAGAGAAGA AGAGCTTTTG CTGAAAAATA TAACATCAGT AAGCTTGATT ATGCCACAGT AGATAATCCC TTTGCAGGAA GAGTTATCTG CGGACACTGC GGCAGTGCCT TTGGAAGAAA GGTATGGAAT TCTACTGATG AAAGGCTAAG AAGAGTGGTT TGGAGATGTA ATAAAAAATA TGAAGTAAAG GGAAAAAAGA GCTGTGAGAA TAAGCATATA GATGACAAGG TTTTATATCA TGCCTTTGTA AATACATTTA ATGCTATGGT GGAGAATAAG GAATACTTTA TGGAGAAGTG GAAGGAAGGA CTTAAAAGTG ATAACTTGCT TAAAAGATAT AAGGCAAGGC AGTTTATTGA AATTTTAAAG GATGCAAAGA TAGTAGAGGA GTTTGATGTT GATATGTATT TTAGAATAAT AGAGAAAATG ACAGTATTTG ATGGAAAAAA GATAATAGTG AGTTTGCTTG ATAGCACGGA GATTGAAGTT GCAATTTAA
|
Protein sequence | MGAYCRVSTD QLEQLSSYEA QVAYYTSYIT NHPDYEFGGI YADEGISGTN TKKREQFNKM IEDCKAGKID MIITKSISRF ARNTLDTLNY VRQLKELGIG VIFEKENINT LDSKGEVLLT ILSSLAQDES RSISENSTWG IRRRFEQGKL HINHTKFLGY DKDEEGNLVI NEKQAKIVRR IYKDYLDGKG ANRIARELEE EGVPNWNGKP KWYESSIRKI LSNEKYKGDA LLQKTYTVDF LTKKRAVNNG EVPMYYVEES HPAIIDKEIW EAVQLEMERR RAFAEKYNIS KLDYATVDNP FAGRVICGHC GSAFGRKVWN STDERLRRVV WRCNKKYEVK GKKSCENKHI DDKVLYHAFV NTFNAMVENK EYFMEKWKEG LKSDNLLKRY KARQFIEILK DAKIVEEFDV DMYFRIIEKM TVFDGKKIIV SLLDSTEIEV AI
|
| |