Gene Cthe_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1671 
Symbol 
ID4808921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1996540 
End bp1998108 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content48% 
IMG OID640107086 
Productrecombinase 
Protein accessionYP_001038087 
Protein GI125974177 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTCA GTAAAAATGT CACAGTGATT CCGGCAAGAA AGCATACTCG CAAGAGCAAG 
GACGAGGAAA AACCGAAACT GCGCGTTGCT GCTTACTGCC GTGTTTCCAC AGACAGCGAG
GAGCAGGCTA CCAGCTATGA CACACAGATT GAGCATTACA CTGCCTACAT ACAAGGGCAC
CCCGACTGGA TGCTGGCAGG AATATTTGCT GATGACGGCA TTTCCGGTAC CAATACTAAG
AAGCGTGAAG AATTCAACCG CATGATTGAC GAGTGTATGG CCGGTAATAT CGATATGATC
ATTACAAAGT CCATCAGCCG CTTTGCGCGA AACACGCTGG ACTGTTTGAA GTATATCCGC
CAACTAAAGG ACAAGAACAT TCCGGTCTAT TTTGAGAAAG AAAACATAAA CACCATGGAT
TCCAAGGGCG AGGTTATGCT CACAATTATG GCATCTCTTG CGCAGCAGGA AAGCCAGTCC
TTAAGTCAGA ATGTGAAACT GGGTTTGCAG TATCGTTACC AACAAGGCGA AATACAAGTC
AACTGTGCAC GGTTCCTCGG TTATACCAAG GATGAGAATA AGCGCCTTGT GGTTGTGCCC
GAAGAAGCTG AAATCGTAAA GCGCATTTAT CGAGAATACC TTGAGGGTGC AAGTATGCTG
AAGATTGCCC GTGGGTTAGA GGCCGACGGT ATCTTAAACG GTGCAGGCAA TGAGCGCTGG
CACACCAGTA ACATTAATAA TATTCTGCGA AACGAAAAGT ACATCGGAGA TGCGCTCTTG
CAGAAAACCT ACACGGTTGA TTTTCTTACA AAAAAGCGGG TTAAGAACAA CGGTATCGTT
CCGCAGTACT ATGTAGAGAA CAGCCATGAA GCCATCATCC CGCGTGAAGT TTTCATGCAG
GTGCAGGAAG AGCTTATCCG CCGCCGTATT GTGCACACAA GCCCAAACGG AAAGACCAGA
ACCTTCAGCA GCAACCACGT CTTTGCTCAG ATAATCATCT GCGGCAAATG CGGTGAGGTT
TTTCGCAGGG TACATTGGAA CAACAGAGGT AAAAAGTCCA TCGTCTGGCG CTGTGTCAGC
CGGTTAGAAA ACACCGGCCT ATTCTGCGAT GCCCGCACGG TACTGGAGAG CACCATCGAG
CAAGTGCTAG TCACCGCCAT TAATCAGACG CTTTGCGACA AAGACTCTTT CCTCACAACT
CTACGGGATA ACATCGCCAC CATCATAAAT CGTGAAAGCG ACAAGGCCTT AGCGGATATC
GATAAGCGGT TGGAGGAGTT GCAAACGGAA CTTCTAAAAT TGGCCACTTC CAATGCGGAT
TATGAAAAGG TTGGCGATGA GATTCACAGC CTGCGCGATC AGAAGCAAAA GCTGCAGGTT
GAAAATGCCA ACCGTGATGA ACTCAAAAAG CAGATTGCTG ATATGAGCAC ATTCCTAAAG
AAGCAGTCCA CCGCCCTCGC CGAATACGAC AAGCAGCTTG TTCGGAGGTT GATTGATAAG
GTCACGGTCT TCGAGGATAA ATTCACCGTG GAATTCAAGT CCGGCGTGAC GGTGGATGTG
GATGAATAA
 
Protein sequence
MAVSKNVTVI PARKHTRKSK DEEKPKLRVA AYCRVSTDSE EQATSYDTQI EHYTAYIQGH 
PDWMLAGIFA DDGISGTNTK KREEFNRMID ECMAGNIDMI ITKSISRFAR NTLDCLKYIR
QLKDKNIPVY FEKENINTMD SKGEVMLTIM ASLAQQESQS LSQNVKLGLQ YRYQQGEIQV
NCARFLGYTK DENKRLVVVP EEAEIVKRIY REYLEGASML KIARGLEADG ILNGAGNERW
HTSNINNILR NEKYIGDALL QKTYTVDFLT KKRVKNNGIV PQYYVENSHE AIIPREVFMQ
VQEELIRRRI VHTSPNGKTR TFSSNHVFAQ IIICGKCGEV FRRVHWNNRG KKSIVWRCVS
RLENTGLFCD ARTVLESTIE QVLVTAINQT LCDKDSFLTT LRDNIATIIN RESDKALADI
DKRLEELQTE LLKLATSNAD YEKVGDEIHS LRDQKQKLQV ENANRDELKK QIADMSTFLK
KQSTALAEYD KQLVRRLIDK VTVFEDKFTV EFKSGVTVDV DE