Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1701 |
Symbol | |
ID | 4808876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2023413 |
End bp | 2024702 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640107114 |
Product | recombinase |
Protein accession | YP_001038115 |
Protein GI | 125974205 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTTA GGATTATTGA ACCTGTAAAA AAGAAGGAAA ATAAAAAGAA AAGAGTTTGT GCTTATGCAA GAGTTTCAAC CGGCAGTGAT GCTCAAGGTG AATCTTTAGA AAATCAAATC CAGTATTATG AAAATCTGAT TTCAAATAAT CCTGATTACG AATATGCAGG TGTATTTGCT GATAGAGGAA TTACCGGCAC TACAGATAAT AGACCAGAGT TTCAAAGAAT GCTTAATCTT GCAAAAGAAG GGAAAATAGA CTTAATCATC ACCAAATCTA TCTCAAGATT TGCAAGGAAT ACAGCAATAA TGCTTCAAGT AGTAAGAGAA CTAAAGGACA TTGGTGTAGA AATAATATTT GAAAAAGAGA ATATCAGAAC TTTATCAGGG GATGGAGAGC TTATGCTAAC CGTCCTCTCT TCTTTTGCCC AGGAAGAAAG TAAAAATATC AGTGATAACT TAAAGTGGAG GGTAAAAAAG AAGTTTGAAC GAGGAGAGCT GATTATAAAT ACCACAAGAT TTTTAGGTTA TGACAAGGAT GAATACGGCG ATTTAGTTAT AAACCCAAAA GAAGCAGAAA TAGTTAAAAG AATATTTGAA GATTATTTAA AAGGCAAAGG AACATTTACC ATAGCCAAAG AATTAAATGA AGATAAAGTG CCTACTGTTG CAGGCGGCAG ATGGCAGGAA AGTACAATTT TAAATATCCT CAAGAATGAA AAATACAAGG GAGATGCCAT ACTTCAAAAG TATTACACAC CAGACCATCT GAGAAAAGTA AGTGTTAGAA ATGAAGGCGT AATTGACAGC TATTATATTG AAGATAATCA CTCTCCCATA GTTTCAAGAG AAATGTGGGA GCAGGTTCAG ATAGAAATTG CAAGAAGAGC AAAGGCAAAA GGAAATAAAG CAGGAGATAC AAAAAAATAT ACAAACAGAT ATCCATTAAC AGGAATGCTT TTCTGCAGTA AATGCGGCTC TACTCTAAGG AGAAGAACTT GGAACAGCAA ATTAAATTGC AGAAAGATTG TATGGCAGTG CAGTAATTAT ATTAAAAACG GAAAAGACGC CTGCAGTGGA ACATCAATTG ATGATGAGGT TATAAGCAGG CTTAATATAG AAGAACCAAT AATTGTAAGG GAGGAAGTTA AGGATGGCAA GAAATATTAC ACTTATACCT GCAAGAGCAA ACAGAAACAA TCTGGCAGAG CAAATACAGA CGCAGAAAAA GAGAATGGGA GCTTATTGCA GAGTATCAAC AGACCAATTA GAACAGTTAT CAAGCTATGA
|
Protein sequence | MRVRIIEPVK KKENKKKRVC AYARVSTGSD AQGESLENQI QYYENLISNN PDYEYAGVFA DRGITGTTDN RPEFQRMLNL AKEGKIDLII TKSISRFARN TAIMLQVVRE LKDIGVEIIF EKENIRTLSG DGELMLTVLS SFAQEESKNI SDNLKWRVKK KFERGELIIN TTRFLGYDKD EYGDLVINPK EAEIVKRIFE DYLKGKGTFT IAKELNEDKV PTVAGGRWQE STILNILKNE KYKGDAILQK YYTPDHLRKV SVRNEGVIDS YYIEDNHSPI VSREMWEQVQ IEIARRAKAK GNKAGDTKKY TNRYPLTGML FCSKCGSTLR RRTWNSKLNC RKIVWQCSNY IKNGKDACSG TSIDDEVISR LNIEEPIIVR EEVKDGKKYY TYTCKSKQKQ SGRANTDAEK ENGSLLQSIN RPIRTVIKL
|
| |