Gene Cthe_0512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0512 
Symbol 
ID4808314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp625728 
End bp627248 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content43% 
IMG OID640105927 
Productphage integrase 
Protein accessionYP_001036942 
Protein GI125973032 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000591844 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGCAG GACACTTGCG TGAGAAGAAC GGATACTACC ACATAATACT TAACTGGAAG 
GACGGTGATG GTAAAAGACG GAGCAAATCA ATTAGCACGG GCCTGCCTGT AAAGGGCAAT
AAGAAAAAAG CCGAGGCTAT GTTATTGGAG GCTCGTAAGA CTTTTAAGCC GGAGGATATT
GCGTCAGGTA AGAACACTCC GTATCACGTC TTTTTAGACA GATGGCTAAA GGACAAGATG
AACGACTTTG ACGAAGAAAC TTACGCAGTA TACAGCCATA ACGCCAGAGT TTTTATCGGT
CCTTACTTTA AGAGCTTAAA TTTGCAGCTC CACGAGATAA GGACCTCAAC ACTGGAAGCC
TACTATAACC ATGAAAAGAC GAAAAACCAT GCGTCTAAAA AGACCATCTT GCAGATACAC
GAGATTATTA CATTATCGTT AAACTACGCA ATAGAGCTTG GCTGGATTGA GAGCAACCCA
GCAAAAGGTA TTAACCCTGC GACAGATGAG GTGTCGGTTT TATTCGTGGA CTTTTTATTG
GAGTGGTTGG AGATGATGCG GACCAGAGTC AGGGAGACCA CATACGCATC CTATAACAGT
GGCATCAGGC AGAGCATAAT CCCCTATTTT ATGGACAAAA AGCTGACACT GACCGATATT
GAGGAAAACC CCAAGTATTT GCAGGACTTT TATCAGCACG AATTAAGTAA GGGACTGTCG
CCTAATTCGG TGTTAAGACG TCATGCCAAC ATTCGCAAAG CCTTGCAGCA CGCTTTTCAT
CTGGGTTTAA TTAAATCTAA TCCAGCCGAC AGGATTGAGC GTCCTAAAAA GCAGGATTAC
ACTGCATCCT ACTACACCGA TGAGGAGCTT GCTAAACTTT TCAAGGCTGC AAAAGGTGAC
CCCTTGGAGT TACCCATAGT TTTAGCGGCT TACTATGGTC TGCGTAGGAG CGAGATAATA
GGGTTAAGGT GGGATGCGAT TGATTTTAAC CCAGATGACC CGAAAATAAC AATACAATTC
ACAGTGACAG AGGTCAACTT TGGAGACGGG CAAGGCAATG TTATTATAGA AAAGGAGGGA
ACCAAATCAA AAGCCAGTAA ACGCACGTTG CCGCTGGTCA AGCCAATCGC AGACTTACTG
TTGCAGAAGA AAAAAGATAT AGAGAATAAC AGGAGGTTAT GCGGTAGTTG CTACAATGAC
AAATACTTGG ACTTTGTTCA CGTTAATGAG ATAGGAGAGC GAATGAAACC AAACTATATA
TCACAGCATT TTGCACTTTT ACTGGAGAAA CATGGGTTAA AGAAAATCAG GTTTCATGAT
TTACGACACT CATGTGCGAG CCTGCTATAC GCAAACGGTG TAAGTCTCAA GCAAATACAG
GAGTGGTTGG GGCACAGCGA TATATCCACG ACAGCCAACA TATACACTCA TTTAGACTAT
AACAGCAAAA TTGCTACGGC AAATGCAATC CTGCCAGTTT TATTCGACCA GCAGGAATCA
ACTGATGAAT TAGAGAGATA A
 
Protein sequence
MIAGHLREKN GYYHIILNWK DGDGKRRSKS ISTGLPVKGN KKKAEAMLLE ARKTFKPEDI 
ASGKNTPYHV FLDRWLKDKM NDFDEETYAV YSHNARVFIG PYFKSLNLQL HEIRTSTLEA
YYNHEKTKNH ASKKTILQIH EIITLSLNYA IELGWIESNP AKGINPATDE VSVLFVDFLL
EWLEMMRTRV RETTYASYNS GIRQSIIPYF MDKKLTLTDI EENPKYLQDF YQHELSKGLS
PNSVLRRHAN IRKALQHAFH LGLIKSNPAD RIERPKKQDY TASYYTDEEL AKLFKAAKGD
PLELPIVLAA YYGLRRSEII GLRWDAIDFN PDDPKITIQF TVTEVNFGDG QGNVIIEKEG
TKSKASKRTL PLVKPIADLL LQKKKDIENN RRLCGSCYND KYLDFVHVNE IGERMKPNYI
SQHFALLLEK HGLKKIRFHD LRHSCASLLY ANGVSLKQIQ EWLGHSDIST TANIYTHLDY
NSKIATANAI LPVLFDQQES TDELER