Gene Ccel_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2143 
Symbol 
ID7312327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2510829 
End bp2512343 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content36% 
IMG OID643609075 
ProductRecombinase 
Protein accessionYP_002506466 
Protein GI220929557 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA CCTGCATTTA TCTAAGAAAA TCAAGAGAAG ACGAGAAAAT GGAAAAGGAG 
CTTGGGAAAG GTGAAACCCT ATCAAAACAC AGGGAAGATT TGTTGAGTTA TGCTAAAATG
AAAGATCTGT GTATTGTAAA TATTTACCAA GAACTTGTTA CAGGTGAAAG TTTGCTTTAC
AGGCCGGCTA TGCTTGAGCT TCTAAAGGAC GTTGAAACGG GACTGTACGA AGCCGTTTTG
GTAATGGATT TACAAAGGCT CGGTCGTGGA GATATGGAAG AACAAGGGAT TATTCTTAAA
GCATTTAAAA ATACAAATAC AAAAATTATA ACCCCGGACA AGGAGTATGA TCTTTCTAAT
GAGTTTGATG AAGAGTATAG CGAATTTGAA GCGTTCATGA GCCGTAAGGA ATATAAAATG
ATTAATAAGC GTCTTCAGCG GGGAATTATT CATTCCGTAA ATAATGGAAA TTACAACTCT
CCCTACCCGC CTTTCGGTTA TACTATAAAG CAGGAAAAAT TGGGACGTAC TTTGGAACCA
CACCCTCAAC AGTCGGAAAT ACTAAAGAGT ATATTCGATT GGTACGTTAA TGAATCAATA
GGCAGTCAGA TTATTGCACA AAGACTGAAC AGTCTGGGGT TAAAGACCAA TAAAAACAAC
AGCTGGACTT GTCAGGCAGT AACAGGCATC CTGAAAAATC CTGTTTATAC AGGAAAAATA
GCATGGCGTA AAACAACAAG CAACCGCATA TCCAAAAAAA AGAATAGAAG GATGCAAGAC
AGACAAGACT GGATTCTTGC AGAAGGAAAA CATCCTGCAT TGATATCGGA GAGTATATAT
GAAAAGGCAC AAAACATTAT GAAAAATAAT AGTAAAACCC ATACAAAACA ATCTGTCAGC
CTTCACAACA CTTTAGCCGG AATAATTGTT TGCGGCGTTT GCGGAGCTAA AATGCGATAC
AGACCTTATC TTAACTCTGA GCCTCATCTT ATATGTATTA ATAAATGCGG AAATAAAAGC
AGTAAGTTTG GTTATATTGA GGACTGTGTT ATTTCAAGCC TTAAAGAATA TCTGAGCGAA
TACAATTTTA AGCTGTATGG AGAACCGCAA GGAAAAAACA GCAGTCAGGC TGCTTTTAAA
AGCAGTGTTG CTCTACTTGA AAAGCAACTG AAGGAAGCAA GTATTCAGAA AGACAATATT
TACAATCTTT TAGAAAAGGG TATTTATTCT ATAGAGGACT TTAATCAAAG ACTACTTTCC
ATTAATACTA GAATATCCGA ACTGCAGCAC TCTATTTCGA ATACGGGAAT ATTGTATAAA
AAATCACTTG TACAAAAGCC CGGTAAAACT GAGTTTTGCA GTGGCCCCAC AAAGATTATG
CACATATATA AAAATTTAAA AAATCCAGAA GATAAAAATA TTTTACTAAA AAGTGTTCTT
GATAAAGTTG AATATATCAA AGGTAAGGAC TGCAGGAACG ATAATTTTAC GCTAAGAATA
TACCCAATGT TTTAA
 
Protein sequence
MIKTCIYLRK SREDEKMEKE LGKGETLSKH REDLLSYAKM KDLCIVNIYQ ELVTGESLLY 
RPAMLELLKD VETGLYEAVL VMDLQRLGRG DMEEQGIILK AFKNTNTKII TPDKEYDLSN
EFDEEYSEFE AFMSRKEYKM INKRLQRGII HSVNNGNYNS PYPPFGYTIK QEKLGRTLEP
HPQQSEILKS IFDWYVNESI GSQIIAQRLN SLGLKTNKNN SWTCQAVTGI LKNPVYTGKI
AWRKTTSNRI SKKKNRRMQD RQDWILAEGK HPALISESIY EKAQNIMKNN SKTHTKQSVS
LHNTLAGIIV CGVCGAKMRY RPYLNSEPHL ICINKCGNKS SKFGYIEDCV ISSLKEYLSE
YNFKLYGEPQ GKNSSQAAFK SSVALLEKQL KEASIQKDNI YNLLEKGIYS IEDFNQRLLS
INTRISELQH SISNTGILYK KSLVQKPGKT EFCSGPTKIM HIYKNLKNPE DKNILLKSVL
DKVEYIKGKD CRNDNFTLRI YPMF