Gene Cthe_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0001 
Symbol 
ID4810536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp104 
End bp1723 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content46% 
IMG OID640105411 
Productrecombinase 
Protein accessionYP_001036436 
Protein GI125972526 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AGCAGCAAAC AAGAACGGCA ATCTATTGCA GGCTCAGCCG GGACGATGAG 
CAAAGCGGCG ACAGCATGAG CATTGAAAAC CAGCGTAGTA TGCTTACCCG CTATGCTAAG
GAAAATGGAC TGGAAATCAT AGACTGGTAT ATTGATGACG GATGGAGCGG TACCAACTTT
GACAGGCCGG ACTTCCAACG GATGAAAAGC GACATTGAGG ACGGGAAAAT AGACATTGTA
CTGGTGAAGG ATTTAAGCCG GTTAGGGCGC AACCAGATAG AAACCAGCCT TTGCATCCAG
GTGTTTTTCC CGCAGCACGA TGTCCGTTTC ATTGCAGTAA GCGAAAACAT TGACACTGCC
AAGGGCGAAG ATGACTTCAT GGAACTGCGC AACCTGTTCA ATGAGTGGTT TGTGCGGGAC
ACAAGCCGAA AAGTAAAAAA CGGCTACCGC CAGCGGGCAC TCAACGGGGA CTATACCGGA
GCCTTCGCTC CCTACGGATA CAAAAAGGAC GAGCAGGACA AGCACAAGCT GGTGCCGGAT
GAAAATGTGG CGCATGTGGT CAAAAGGATA TTCCAGATGG CAGTAGAGGG GTACAGCCCT
TACAAAATCA GCATGGCGCT ACGAGCGGAT AAAATCCTTA CACCAAGGGC GTATGTGGCA
CAGGAATACC AGCGGTATGC AAACGTGTTT AATCCTCAAT ATCCATATGA TTGGGGAGCC
ACGACCATCA AAATCATCAT ACAAAACAAG GTATATCTTG GGCACATGGT AAGCCACAAA
TACACCAAAA AGTCCTTTAA AACGAAAGAA GTCGTAGCTG TGCCGGAAAA TGAATGGATT
GAGGTCAAAA ACACTCATGA GCCGCTAATC GATGAAGAAA CCTTTGAACT GGCGCAAAAA
ATCATCCGGG TAAAAAAGCG TCCTACCAAA GAAGGAGAAC ATCAGATATT TGCCGGGTTG
CTCAGATGCT CCACCTGCGG ACAAAGCCTA TCTTTTGCAA GGGGCGGCAA TAGCAAATAC
AGCGGCGGTA AAGGAGGACG CGGCAGCTTT GCTTGTAACC AATCCCGGCG CAAGGGTAAG
GAATATTGCA GCTTTCATTA CATCAGTTAC CTTGACATCT ATACCGTCAT TTTGGAGGAC
ATACGGAAAA ATGCAGCTAT TGCAAGAGAA AACGAAGCCG CCTTTGTGGA GATGGTGTCA
GACATCAGCA AGGCTAAGCT CAAAAAGCAA GTATCGGCGG CAGCTAAAGA AAAGGAAAAG
CTAAGGCACA GGGAAAACGA ACTGCAAGCT ATTTTAAAAA AGCTCTATGA GGACAATGCT
CTGGGAAAAA TCACTGATGA ACAGTTTATT TCCCTGTCAA AGGACTTCAC CGACGAGCAG
AGACAGATAA AAGAGCGGCT AAAGGCACTG GAAAATATCC TAAGCCAAGT GACAGAAAAG
CAAGAAAATA CAGCGAAATT CCTTGAACTG GTGCGGGAAT ACACCGATAT TAAGGAATTA
ACCAAGCCAA TACTCAATGA GCTGATCGAC AAGGTGGTAG TCTTTGACGC AGAAAAGGCC
AGAGGCGACC GGGTACAGAG AATTGACATT TACTATAGGT TTGTAGGGTT AATTGCGTAA
 
Protein sequence
MKKQQQTRTA IYCRLSRDDE QSGDSMSIEN QRSMLTRYAK ENGLEIIDWY IDDGWSGTNF 
DRPDFQRMKS DIEDGKIDIV LVKDLSRLGR NQIETSLCIQ VFFPQHDVRF IAVSENIDTA
KGEDDFMELR NLFNEWFVRD TSRKVKNGYR QRALNGDYTG AFAPYGYKKD EQDKHKLVPD
ENVAHVVKRI FQMAVEGYSP YKISMALRAD KILTPRAYVA QEYQRYANVF NPQYPYDWGA
TTIKIIIQNK VYLGHMVSHK YTKKSFKTKE VVAVPENEWI EVKNTHEPLI DEETFELAQK
IIRVKKRPTK EGEHQIFAGL LRCSTCGQSL SFARGGNSKY SGGKGGRGSF ACNQSRRKGK
EYCSFHYISY LDIYTVILED IRKNAAIARE NEAAFVEMVS DISKAKLKKQ VSAAAKEKEK
LRHRENELQA ILKKLYEDNA LGKITDEQFI SLSKDFTDEQ RQIKERLKAL ENILSQVTEK
QENTAKFLEL VREYTDIKEL TKPILNELID KVVVFDAEKA RGDRVQRIDI YYRFVGLIA