Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1609 |
Symbol | |
ID | 4809599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1938931 |
End bp | 1940499 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107025 |
Product | recombinase |
Protein accession | YP_001038026 |
Protein GI | 125974116 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000418639 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAAAGG TAACGAGGAT TGATGGGAAC AATGCTCTCC AAGCTTTCAA GCCAAAGGTG AGGGTAGCGG CTTATTGCAG GGTTTCAACA GACAGTGATG AACAAATGGC AACCCTGGAA GCACAAAAAG ACCATTATGA ATCCTATATA AAAGCAAATC CTGATTGGGA ATTTGCAGGG ATTTACTATG ATGAAGGCAT ATCAGGCACA AAAAAGGAGA ATCGGACTGG ACTTTTAAGG CTGCTTGCAG ATTGTGAAAA CAAGAAAATT GACTTTATTA TAACCAAGTC GGTCAGCAGA TTTGCCAGAA ACACAACCGA CTGTATTGAG ATGGTGAGAA AACTTACCGA TCTCGGTGTT TTCATCTATT TCGAGAAAGA GAATATAAAC ACACAGCGCA TGGAAGGCGA ATTGGTGCTG ACAATTTTGA GCAGTCTTGC AGAAAACGAG TCATTATCCA TTGCAGAAAA TAGTAAGTGG TCTATCAGAC GTAGGTTCCA AAACGGAACA TACAAAATTT CGTATCCTCC CTATGGTTAC GATTATGTGG ATGGAAAGCT ATTTATCAAT AAAGAACAGG CTGAAATCGT AAAGCGGATT TTTTCCGAGG CTTTGGACGG TAAAGGCACA CAGAAAATTG CAGATGGGCT AAATTCGGAT AAAATCCCAA CAAAGAGAGG TTCACACTGG ACAGCGACAA CTATCCGCGG CATTTTAAGC AATGAAAAAT ATACTGGGGA TGTCCTTCTG CAAAAGACCT ATACAGATGA GAATTTTAAA CGGCATTATA ATCATGGGGA AAAAGATCAA TACATGATAA AAGATCATCA TGAAGCCATT ATATCCCATG AGGAATTTGA AGCCGTCAAA GAAATATTGA AGCAAAGAGG TAAAGAAAAA GGCGTAATCA AGGGAAGCAG TAAATATCAA AACCGCTACC CTTTCTCGGG GAAAATCAAA TGCGCAGAAT GTGGCAGCAG TTTTAAGCGT CGAATTCATG GCAGTGGTAA TCATAAATAC ATTGCCTGGT GCTGCACAAA GCATATAAAG GACGCATCAA GCTGTTCCAT GAAGTTTGTC AGAGAGGATG CGATCCATCA GGCCTTCGTT GTAATGATCA ATAAGCTTAT TTTCGGACAT AAGTTCATTC TAAGGCCATT GCTGCAAAGC TTAAAGAAAA CAAATTACTC AGATAACATA ACTAAGATTC AGGAACTGGA AACTAAAATC AAAGAAAATA CAGAGCAAGT TCAGGTGATT ATGGGACTTA TGGCCAAAGG ATACCTGGAA CCCGCTCTTT TTAATACACA GAAAAATGAG CTGCTCAAAG AAGCGGCTTT ATTAAAAGAA CAAAGAGAAG CATTAAAACG CGTAATCGAT GGAGGCATGA CTACTCTTGT TGAGGTAGAA AAGCTTTTTA AATTTGCAAC GAAGGCTGAA AAGCAGATTG ATGCATTTGA TAGCGATATA TTTGAGAACT TTATTGAAGA AATCATTGTG TTTTCACAGG AGGAAATAGG TTTCAAAATG AAATGCGGGT TGAACTTGAG GGAAAGGTTG ATGAAATGA
|
Protein sequence | MRKVTRIDGN NALQAFKPKV RVAAYCRVST DSDEQMATLE AQKDHYESYI KANPDWEFAG IYYDEGISGT KKENRTGLLR LLADCENKKI DFIITKSVSR FARNTTDCIE MVRKLTDLGV FIYFEKENIN TQRMEGELVL TILSSLAENE SLSIAENSKW SIRRRFQNGT YKISYPPYGY DYVDGKLFIN KEQAEIVKRI FSEALDGKGT QKIADGLNSD KIPTKRGSHW TATTIRGILS NEKYTGDVLL QKTYTDENFK RHYNHGEKDQ YMIKDHHEAI ISHEEFEAVK EILKQRGKEK GVIKGSSKYQ NRYPFSGKIK CAECGSSFKR RIHGSGNHKY IAWCCTKHIK DASSCSMKFV REDAIHQAFV VMINKLIFGH KFILRPLLQS LKKTNYSDNI TKIQELETKI KENTEQVQVI MGLMAKGYLE PALFNTQKNE LLKEAALLKE QREALKRVID GGMTTLVEVE KLFKFATKAE KQIDAFDSDI FENFIEEIIV FSQEEIGFKM KCGLNLRERL MK
|
| |