Gene Cthe_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1701 
Symbol 
ID4808876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2023413 
End bp2024702 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content35% 
IMG OID640107114 
Productrecombinase 
Protein accessionYP_001038115 
Protein GI125974205 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTA GGATTATTGA ACCTGTAAAA AAGAAGGAAA ATAAAAAGAA AAGAGTTTGT 
GCTTATGCAA GAGTTTCAAC CGGCAGTGAT GCTCAAGGTG AATCTTTAGA AAATCAAATC
CAGTATTATG AAAATCTGAT TTCAAATAAT CCTGATTACG AATATGCAGG TGTATTTGCT
GATAGAGGAA TTACCGGCAC TACAGATAAT AGACCAGAGT TTCAAAGAAT GCTTAATCTT
GCAAAAGAAG GGAAAATAGA CTTAATCATC ACCAAATCTA TCTCAAGATT TGCAAGGAAT
ACAGCAATAA TGCTTCAAGT AGTAAGAGAA CTAAAGGACA TTGGTGTAGA AATAATATTT
GAAAAAGAGA ATATCAGAAC TTTATCAGGG GATGGAGAGC TTATGCTAAC CGTCCTCTCT
TCTTTTGCCC AGGAAGAAAG TAAAAATATC AGTGATAACT TAAAGTGGAG GGTAAAAAAG
AAGTTTGAAC GAGGAGAGCT GATTATAAAT ACCACAAGAT TTTTAGGTTA TGACAAGGAT
GAATACGGCG ATTTAGTTAT AAACCCAAAA GAAGCAGAAA TAGTTAAAAG AATATTTGAA
GATTATTTAA AAGGCAAAGG AACATTTACC ATAGCCAAAG AATTAAATGA AGATAAAGTG
CCTACTGTTG CAGGCGGCAG ATGGCAGGAA AGTACAATTT TAAATATCCT CAAGAATGAA
AAATACAAGG GAGATGCCAT ACTTCAAAAG TATTACACAC CAGACCATCT GAGAAAAGTA
AGTGTTAGAA ATGAAGGCGT AATTGACAGC TATTATATTG AAGATAATCA CTCTCCCATA
GTTTCAAGAG AAATGTGGGA GCAGGTTCAG ATAGAAATTG CAAGAAGAGC AAAGGCAAAA
GGAAATAAAG CAGGAGATAC AAAAAAATAT ACAAACAGAT ATCCATTAAC AGGAATGCTT
TTCTGCAGTA AATGCGGCTC TACTCTAAGG AGAAGAACTT GGAACAGCAA ATTAAATTGC
AGAAAGATTG TATGGCAGTG CAGTAATTAT ATTAAAAACG GAAAAGACGC CTGCAGTGGA
ACATCAATTG ATGATGAGGT TATAAGCAGG CTTAATATAG AAGAACCAAT AATTGTAAGG
GAGGAAGTTA AGGATGGCAA GAAATATTAC ACTTATACCT GCAAGAGCAA ACAGAAACAA
TCTGGCAGAG CAAATACAGA CGCAGAAAAA GAGAATGGGA GCTTATTGCA GAGTATCAAC
AGACCAATTA GAACAGTTAT CAAGCTATGA
 
Protein sequence
MRVRIIEPVK KKENKKKRVC AYARVSTGSD AQGESLENQI QYYENLISNN PDYEYAGVFA 
DRGITGTTDN RPEFQRMLNL AKEGKIDLII TKSISRFARN TAIMLQVVRE LKDIGVEIIF
EKENIRTLSG DGELMLTVLS SFAQEESKNI SDNLKWRVKK KFERGELIIN TTRFLGYDKD
EYGDLVINPK EAEIVKRIFE DYLKGKGTFT IAKELNEDKV PTVAGGRWQE STILNILKNE
KYKGDAILQK YYTPDHLRKV SVRNEGVIDS YYIEDNHSPI VSREMWEQVQ IEIARRAKAK
GNKAGDTKKY TNRYPLTGML FCSKCGSTLR RRTWNSKLNC RKIVWQCSNY IKNGKDACSG
TSIDDEVISR LNIEEPIIVR EEVKDGKKYY TYTCKSKQKQ SGRANTDAEK ENGSLLQSIN
RPIRTVIKL