Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0512 |
Symbol | |
ID | 4808314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 625728 |
End bp | 627248 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105927 |
Product | phage integrase |
Protein accession | YP_001036942 |
Protein GI | 125973032 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000591844 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGCAG GACACTTGCG TGAGAAGAAC GGATACTACC ACATAATACT TAACTGGAAG GACGGTGATG GTAAAAGACG GAGCAAATCA ATTAGCACGG GCCTGCCTGT AAAGGGCAAT AAGAAAAAAG CCGAGGCTAT GTTATTGGAG GCTCGTAAGA CTTTTAAGCC GGAGGATATT GCGTCAGGTA AGAACACTCC GTATCACGTC TTTTTAGACA GATGGCTAAA GGACAAGATG AACGACTTTG ACGAAGAAAC TTACGCAGTA TACAGCCATA ACGCCAGAGT TTTTATCGGT CCTTACTTTA AGAGCTTAAA TTTGCAGCTC CACGAGATAA GGACCTCAAC ACTGGAAGCC TACTATAACC ATGAAAAGAC GAAAAACCAT GCGTCTAAAA AGACCATCTT GCAGATACAC GAGATTATTA CATTATCGTT AAACTACGCA ATAGAGCTTG GCTGGATTGA GAGCAACCCA GCAAAAGGTA TTAACCCTGC GACAGATGAG GTGTCGGTTT TATTCGTGGA CTTTTTATTG GAGTGGTTGG AGATGATGCG GACCAGAGTC AGGGAGACCA CATACGCATC CTATAACAGT GGCATCAGGC AGAGCATAAT CCCCTATTTT ATGGACAAAA AGCTGACACT GACCGATATT GAGGAAAACC CCAAGTATTT GCAGGACTTT TATCAGCACG AATTAAGTAA GGGACTGTCG CCTAATTCGG TGTTAAGACG TCATGCCAAC ATTCGCAAAG CCTTGCAGCA CGCTTTTCAT CTGGGTTTAA TTAAATCTAA TCCAGCCGAC AGGATTGAGC GTCCTAAAAA GCAGGATTAC ACTGCATCCT ACTACACCGA TGAGGAGCTT GCTAAACTTT TCAAGGCTGC AAAAGGTGAC CCCTTGGAGT TACCCATAGT TTTAGCGGCT TACTATGGTC TGCGTAGGAG CGAGATAATA GGGTTAAGGT GGGATGCGAT TGATTTTAAC CCAGATGACC CGAAAATAAC AATACAATTC ACAGTGACAG AGGTCAACTT TGGAGACGGG CAAGGCAATG TTATTATAGA AAAGGAGGGA ACCAAATCAA AAGCCAGTAA ACGCACGTTG CCGCTGGTCA AGCCAATCGC AGACTTACTG TTGCAGAAGA AAAAAGATAT AGAGAATAAC AGGAGGTTAT GCGGTAGTTG CTACAATGAC AAATACTTGG ACTTTGTTCA CGTTAATGAG ATAGGAGAGC GAATGAAACC AAACTATATA TCACAGCATT TTGCACTTTT ACTGGAGAAA CATGGGTTAA AGAAAATCAG GTTTCATGAT TTACGACACT CATGTGCGAG CCTGCTATAC GCAAACGGTG TAAGTCTCAA GCAAATACAG GAGTGGTTGG GGCACAGCGA TATATCCACG ACAGCCAACA TATACACTCA TTTAGACTAT AACAGCAAAA TTGCTACGGC AAATGCAATC CTGCCAGTTT TATTCGACCA GCAGGAATCA ACTGATGAAT TAGAGAGATA A
|
Protein sequence | MIAGHLREKN GYYHIILNWK DGDGKRRSKS ISTGLPVKGN KKKAEAMLLE ARKTFKPEDI ASGKNTPYHV FLDRWLKDKM NDFDEETYAV YSHNARVFIG PYFKSLNLQL HEIRTSTLEA YYNHEKTKNH ASKKTILQIH EIITLSLNYA IELGWIESNP AKGINPATDE VSVLFVDFLL EWLEMMRTRV RETTYASYNS GIRQSIIPYF MDKKLTLTDI EENPKYLQDF YQHELSKGLS PNSVLRRHAN IRKALQHAFH LGLIKSNPAD RIERPKKQDY TASYYTDEEL AKLFKAAKGD PLELPIVLAA YYGLRRSEII GLRWDAIDFN PDDPKITIQF TVTEVNFGDG QGNVIIEKEG TKSKASKRTL PLVKPIADLL LQKKKDIENN RRLCGSCYND KYLDFVHVNE IGERMKPNYI SQHFALLLEK HGLKKIRFHD LRHSCASLLY ANGVSLKQIQ EWLGHSDIST TANIYTHLDY NSKIATANAI LPVLFDQQES TDELER
|
| |