Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0001 |
Symbol | |
ID | 4810536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 104 |
End bp | 1723 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640105411 |
Product | recombinase |
Protein accession | YP_001036436 |
Protein GI | 125972526 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC AGCAGCAAAC AAGAACGGCA ATCTATTGCA GGCTCAGCCG GGACGATGAG CAAAGCGGCG ACAGCATGAG CATTGAAAAC CAGCGTAGTA TGCTTACCCG CTATGCTAAG GAAAATGGAC TGGAAATCAT AGACTGGTAT ATTGATGACG GATGGAGCGG TACCAACTTT GACAGGCCGG ACTTCCAACG GATGAAAAGC GACATTGAGG ACGGGAAAAT AGACATTGTA CTGGTGAAGG ATTTAAGCCG GTTAGGGCGC AACCAGATAG AAACCAGCCT TTGCATCCAG GTGTTTTTCC CGCAGCACGA TGTCCGTTTC ATTGCAGTAA GCGAAAACAT TGACACTGCC AAGGGCGAAG ATGACTTCAT GGAACTGCGC AACCTGTTCA ATGAGTGGTT TGTGCGGGAC ACAAGCCGAA AAGTAAAAAA CGGCTACCGC CAGCGGGCAC TCAACGGGGA CTATACCGGA GCCTTCGCTC CCTACGGATA CAAAAAGGAC GAGCAGGACA AGCACAAGCT GGTGCCGGAT GAAAATGTGG CGCATGTGGT CAAAAGGATA TTCCAGATGG CAGTAGAGGG GTACAGCCCT TACAAAATCA GCATGGCGCT ACGAGCGGAT AAAATCCTTA CACCAAGGGC GTATGTGGCA CAGGAATACC AGCGGTATGC AAACGTGTTT AATCCTCAAT ATCCATATGA TTGGGGAGCC ACGACCATCA AAATCATCAT ACAAAACAAG GTATATCTTG GGCACATGGT AAGCCACAAA TACACCAAAA AGTCCTTTAA AACGAAAGAA GTCGTAGCTG TGCCGGAAAA TGAATGGATT GAGGTCAAAA ACACTCATGA GCCGCTAATC GATGAAGAAA CCTTTGAACT GGCGCAAAAA ATCATCCGGG TAAAAAAGCG TCCTACCAAA GAAGGAGAAC ATCAGATATT TGCCGGGTTG CTCAGATGCT CCACCTGCGG ACAAAGCCTA TCTTTTGCAA GGGGCGGCAA TAGCAAATAC AGCGGCGGTA AAGGAGGACG CGGCAGCTTT GCTTGTAACC AATCCCGGCG CAAGGGTAAG GAATATTGCA GCTTTCATTA CATCAGTTAC CTTGACATCT ATACCGTCAT TTTGGAGGAC ATACGGAAAA ATGCAGCTAT TGCAAGAGAA AACGAAGCCG CCTTTGTGGA GATGGTGTCA GACATCAGCA AGGCTAAGCT CAAAAAGCAA GTATCGGCGG CAGCTAAAGA AAAGGAAAAG CTAAGGCACA GGGAAAACGA ACTGCAAGCT ATTTTAAAAA AGCTCTATGA GGACAATGCT CTGGGAAAAA TCACTGATGA ACAGTTTATT TCCCTGTCAA AGGACTTCAC CGACGAGCAG AGACAGATAA AAGAGCGGCT AAAGGCACTG GAAAATATCC TAAGCCAAGT GACAGAAAAG CAAGAAAATA CAGCGAAATT CCTTGAACTG GTGCGGGAAT ACACCGATAT TAAGGAATTA ACCAAGCCAA TACTCAATGA GCTGATCGAC AAGGTGGTAG TCTTTGACGC AGAAAAGGCC AGAGGCGACC GGGTACAGAG AATTGACATT TACTATAGGT TTGTAGGGTT AATTGCGTAA
|
Protein sequence | MKKQQQTRTA IYCRLSRDDE QSGDSMSIEN QRSMLTRYAK ENGLEIIDWY IDDGWSGTNF DRPDFQRMKS DIEDGKIDIV LVKDLSRLGR NQIETSLCIQ VFFPQHDVRF IAVSENIDTA KGEDDFMELR NLFNEWFVRD TSRKVKNGYR QRALNGDYTG AFAPYGYKKD EQDKHKLVPD ENVAHVVKRI FQMAVEGYSP YKISMALRAD KILTPRAYVA QEYQRYANVF NPQYPYDWGA TTIKIIIQNK VYLGHMVSHK YTKKSFKTKE VVAVPENEWI EVKNTHEPLI DEETFELAQK IIRVKKRPTK EGEHQIFAGL LRCSTCGQSL SFARGGNSKY SGGKGGRGSF ACNQSRRKGK EYCSFHYISY LDIYTVILED IRKNAAIARE NEAAFVEMVS DISKAKLKKQ VSAAAKEKEK LRHRENELQA ILKKLYEDNA LGKITDEQFI SLSKDFTDEQ RQIKERLKAL ENILSQVTEK QENTAKFLEL VREYTDIKEL TKPILNELID KVVVFDAEKA RGDRVQRIDI YYRFVGLIA
|
| |