Gene Cthe_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1609 
Symbol 
ID4809599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1938931 
End bp1940499 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content39% 
IMG OID640107025 
Productrecombinase 
Protein accessionYP_001038026 
Protein GI125974116 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000418639 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAAAGG TAACGAGGAT TGATGGGAAC AATGCTCTCC AAGCTTTCAA GCCAAAGGTG 
AGGGTAGCGG CTTATTGCAG GGTTTCAACA GACAGTGATG AACAAATGGC AACCCTGGAA
GCACAAAAAG ACCATTATGA ATCCTATATA AAAGCAAATC CTGATTGGGA ATTTGCAGGG
ATTTACTATG ATGAAGGCAT ATCAGGCACA AAAAAGGAGA ATCGGACTGG ACTTTTAAGG
CTGCTTGCAG ATTGTGAAAA CAAGAAAATT GACTTTATTA TAACCAAGTC GGTCAGCAGA
TTTGCCAGAA ACACAACCGA CTGTATTGAG ATGGTGAGAA AACTTACCGA TCTCGGTGTT
TTCATCTATT TCGAGAAAGA GAATATAAAC ACACAGCGCA TGGAAGGCGA ATTGGTGCTG
ACAATTTTGA GCAGTCTTGC AGAAAACGAG TCATTATCCA TTGCAGAAAA TAGTAAGTGG
TCTATCAGAC GTAGGTTCCA AAACGGAACA TACAAAATTT CGTATCCTCC CTATGGTTAC
GATTATGTGG ATGGAAAGCT ATTTATCAAT AAAGAACAGG CTGAAATCGT AAAGCGGATT
TTTTCCGAGG CTTTGGACGG TAAAGGCACA CAGAAAATTG CAGATGGGCT AAATTCGGAT
AAAATCCCAA CAAAGAGAGG TTCACACTGG ACAGCGACAA CTATCCGCGG CATTTTAAGC
AATGAAAAAT ATACTGGGGA TGTCCTTCTG CAAAAGACCT ATACAGATGA GAATTTTAAA
CGGCATTATA ATCATGGGGA AAAAGATCAA TACATGATAA AAGATCATCA TGAAGCCATT
ATATCCCATG AGGAATTTGA AGCCGTCAAA GAAATATTGA AGCAAAGAGG TAAAGAAAAA
GGCGTAATCA AGGGAAGCAG TAAATATCAA AACCGCTACC CTTTCTCGGG GAAAATCAAA
TGCGCAGAAT GTGGCAGCAG TTTTAAGCGT CGAATTCATG GCAGTGGTAA TCATAAATAC
ATTGCCTGGT GCTGCACAAA GCATATAAAG GACGCATCAA GCTGTTCCAT GAAGTTTGTC
AGAGAGGATG CGATCCATCA GGCCTTCGTT GTAATGATCA ATAAGCTTAT TTTCGGACAT
AAGTTCATTC TAAGGCCATT GCTGCAAAGC TTAAAGAAAA CAAATTACTC AGATAACATA
ACTAAGATTC AGGAACTGGA AACTAAAATC AAAGAAAATA CAGAGCAAGT TCAGGTGATT
ATGGGACTTA TGGCCAAAGG ATACCTGGAA CCCGCTCTTT TTAATACACA GAAAAATGAG
CTGCTCAAAG AAGCGGCTTT ATTAAAAGAA CAAAGAGAAG CATTAAAACG CGTAATCGAT
GGAGGCATGA CTACTCTTGT TGAGGTAGAA AAGCTTTTTA AATTTGCAAC GAAGGCTGAA
AAGCAGATTG ATGCATTTGA TAGCGATATA TTTGAGAACT TTATTGAAGA AATCATTGTG
TTTTCACAGG AGGAAATAGG TTTCAAAATG AAATGCGGGT TGAACTTGAG GGAAAGGTTG
ATGAAATGA
 
Protein sequence
MRKVTRIDGN NALQAFKPKV RVAAYCRVST DSDEQMATLE AQKDHYESYI KANPDWEFAG 
IYYDEGISGT KKENRTGLLR LLADCENKKI DFIITKSVSR FARNTTDCIE MVRKLTDLGV
FIYFEKENIN TQRMEGELVL TILSSLAENE SLSIAENSKW SIRRRFQNGT YKISYPPYGY
DYVDGKLFIN KEQAEIVKRI FSEALDGKGT QKIADGLNSD KIPTKRGSHW TATTIRGILS
NEKYTGDVLL QKTYTDENFK RHYNHGEKDQ YMIKDHHEAI ISHEEFEAVK EILKQRGKEK
GVIKGSSKYQ NRYPFSGKIK CAECGSSFKR RIHGSGNHKY IAWCCTKHIK DASSCSMKFV
REDAIHQAFV VMINKLIFGH KFILRPLLQS LKKTNYSDNI TKIQELETKI KENTEQVQVI
MGLMAKGYLE PALFNTQKNE LLKEAALLKE QREALKRVID GGMTTLVEVE KLFKFATKAE
KQIDAFDSDI FENFIEEIIV FSQEEIGFKM KCGLNLRERL MK