Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1671 |
Symbol | |
ID | 4808921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1996540 |
End bp | 1998108 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640107086 |
Product | recombinase |
Protein accession | YP_001038087 |
Protein GI | 125974177 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTCA GTAAAAATGT CACAGTGATT CCGGCAAGAA AGCATACTCG CAAGAGCAAG GACGAGGAAA AACCGAAACT GCGCGTTGCT GCTTACTGCC GTGTTTCCAC AGACAGCGAG GAGCAGGCTA CCAGCTATGA CACACAGATT GAGCATTACA CTGCCTACAT ACAAGGGCAC CCCGACTGGA TGCTGGCAGG AATATTTGCT GATGACGGCA TTTCCGGTAC CAATACTAAG AAGCGTGAAG AATTCAACCG CATGATTGAC GAGTGTATGG CCGGTAATAT CGATATGATC ATTACAAAGT CCATCAGCCG CTTTGCGCGA AACACGCTGG ACTGTTTGAA GTATATCCGC CAACTAAAGG ACAAGAACAT TCCGGTCTAT TTTGAGAAAG AAAACATAAA CACCATGGAT TCCAAGGGCG AGGTTATGCT CACAATTATG GCATCTCTTG CGCAGCAGGA AAGCCAGTCC TTAAGTCAGA ATGTGAAACT GGGTTTGCAG TATCGTTACC AACAAGGCGA AATACAAGTC AACTGTGCAC GGTTCCTCGG TTATACCAAG GATGAGAATA AGCGCCTTGT GGTTGTGCCC GAAGAAGCTG AAATCGTAAA GCGCATTTAT CGAGAATACC TTGAGGGTGC AAGTATGCTG AAGATTGCCC GTGGGTTAGA GGCCGACGGT ATCTTAAACG GTGCAGGCAA TGAGCGCTGG CACACCAGTA ACATTAATAA TATTCTGCGA AACGAAAAGT ACATCGGAGA TGCGCTCTTG CAGAAAACCT ACACGGTTGA TTTTCTTACA AAAAAGCGGG TTAAGAACAA CGGTATCGTT CCGCAGTACT ATGTAGAGAA CAGCCATGAA GCCATCATCC CGCGTGAAGT TTTCATGCAG GTGCAGGAAG AGCTTATCCG CCGCCGTATT GTGCACACAA GCCCAAACGG AAAGACCAGA ACCTTCAGCA GCAACCACGT CTTTGCTCAG ATAATCATCT GCGGCAAATG CGGTGAGGTT TTTCGCAGGG TACATTGGAA CAACAGAGGT AAAAAGTCCA TCGTCTGGCG CTGTGTCAGC CGGTTAGAAA ACACCGGCCT ATTCTGCGAT GCCCGCACGG TACTGGAGAG CACCATCGAG CAAGTGCTAG TCACCGCCAT TAATCAGACG CTTTGCGACA AAGACTCTTT CCTCACAACT CTACGGGATA ACATCGCCAC CATCATAAAT CGTGAAAGCG ACAAGGCCTT AGCGGATATC GATAAGCGGT TGGAGGAGTT GCAAACGGAA CTTCTAAAAT TGGCCACTTC CAATGCGGAT TATGAAAAGG TTGGCGATGA GATTCACAGC CTGCGCGATC AGAAGCAAAA GCTGCAGGTT GAAAATGCCA ACCGTGATGA ACTCAAAAAG CAGATTGCTG ATATGAGCAC ATTCCTAAAG AAGCAGTCCA CCGCCCTCGC CGAATACGAC AAGCAGCTTG TTCGGAGGTT GATTGATAAG GTCACGGTCT TCGAGGATAA ATTCACCGTG GAATTCAAGT CCGGCGTGAC GGTGGATGTG GATGAATAA
|
Protein sequence | MAVSKNVTVI PARKHTRKSK DEEKPKLRVA AYCRVSTDSE EQATSYDTQI EHYTAYIQGH PDWMLAGIFA DDGISGTNTK KREEFNRMID ECMAGNIDMI ITKSISRFAR NTLDCLKYIR QLKDKNIPVY FEKENINTMD SKGEVMLTIM ASLAQQESQS LSQNVKLGLQ YRYQQGEIQV NCARFLGYTK DENKRLVVVP EEAEIVKRIY REYLEGASML KIARGLEADG ILNGAGNERW HTSNINNILR NEKYIGDALL QKTYTVDFLT KKRVKNNGIV PQYYVENSHE AIIPREVFMQ VQEELIRRRI VHTSPNGKTR TFSSNHVFAQ IIICGKCGEV FRRVHWNNRG KKSIVWRCVS RLENTGLFCD ARTVLESTIE QVLVTAINQT LCDKDSFLTT LRDNIATIIN RESDKALADI DKRLEELQTE LLKLATSNAD YEKVGDEIHS LRDQKQKLQV ENANRDELKK QIADMSTFLK KQSTALAEYD KQLVRRLIDK VTVFEDKFTV EFKSGVTVDV DE
|
| |