Gene Ccel_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2119 
Symbol 
ID7310817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2483235 
End bp2486063 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content40% 
IMG OID643609053 
Productexcinuclease ABC, A subunit 
Protein accessionYP_002506444 
Protein GI220929535 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000168396 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGGG ATAATATTTT TATAAAAGGT GCACGTGAAC ATAACCTGAA AAATGTTGAT 
GTAGAGATAC CGAGAGATAA ATTTGTTGTT ATAACAGGAT TGAGTGGTTC AGGTAAGTCT
TCTCTTGCCT TCGATACAAT TTATGCAGAA GGTCAGAGGA GGTATGTTGA ATCTCTCTCA
TCCTATGCAA GACAGTTTCT TGGACAGATG GACAAGCCGG ATGTAGACTA TATCGAAGGT
TTGTCACCGG CAATATCTAT AGATCAGAAG ACAACCACCA GAAACCCAAG ATCTACAGTA
GGAACTGTAA CTGAAATTCA TGATTATCTC AGACTCCTAT ACTCAAGGAT AGGAATTCCT
CATTGCCCTA AGTGCGGCAA GGAGATTGCA CAGCAAACTA TTGACCAGAT GGTGGATCAG
ATAGTATCTT TTGAAGAAGG CACCAGGATA CAGTTACTTG CTCCTGTAGT TAGAGGGAGA
AAAGGTGAGT ATCATAAGCT CATAGAGAAT GCAAAAAAAG ACGGCTTTGT GAGATTGCGT
GTAGACGGGC AAATAGTAGA TGTAAATGAA GAGATTAAGC TGGACAAAAA TAAAAAGCAT
AATATTGAGA TAGTCGTTGA TAGGCTTGTA GTCCGTGGAG ATATTCAGAA AAGACTGGTT
GATTCATTGG AGACAGTACT TCGCTTAAGC GGTGGTATAG TTATTGTTGA TTTAGTAGGT
AAAGAAGAAA TACTATTTAG TCAGAACTTT GCATGCAGTG ATTGCGGAAT TAGTATAGAA
GAGCTTGTAC CAAGAATGTT TTCATTTAAC AATCCGTTTG GTGCCTGCCA AACATGTACA
GGATTGGGAA ATCTCATGAA GGTAGATCCC GAGCTTGTTA TACCTGACAG GTCTCTGTCA
CTTACAAATG GTGCAATTAG TGTTACAGGT TGGAATATTG GGAGTGAGGA TGCGTATATC
AGGATGATAT TCAATGCTCT TGCCAAACAT TACAAATTTG ATTTAGATAC ACCGTTTAAT
AAGCTGTCCG GAGAAATAAT AGATATTATA CTTTATGGGA CAAGGGGCGA AAAAATAAAA
GTAGATTACG AAAGAGAGTA CGGCAGCGGG TCTTATATGG CGGGATTTGA GGGTGTAATT
AATGCAATTG AACGCCGCCA CAATGAGACC CAGTCCGAAA GTTCGAAACA GTATTACGAA
CAGTTCATGA GCAACAACCC ATGCCCGGAC TGTAAAGGTG CCAGATTAAA ACCTGAAAGT
CTTGCTGTTA CAGTAGGGGG TAAAAATATA CACAAAGTAT CTTCCATGTC TGTAGCAGAT
ACAAAAGATT TCTTTGACAA CATTGAGCTT AGTGAAAGAG ACAAGATGAT AGCAAACCAG
ATTTTAAAAG AGATTGCAGC CAGAATAGGC TTTCTCGTTG ACGTTGGTCT GGACTACCTT
ACACTTTCCA GACCGGCTGG GACTTTATCC GGAGGAGAGG CACAGAGAAT AAGGCTAGCT
ACCCAGATTG GCTCGGGCTT GATGGGGGTA CTTTATATTC TGGATGAACC CAGTATCGGA
CTGCATCAAA GAGATAACGA AAAGCTTCTT AAAACCTTAA ACCGACTTAG GAACCTTGGA
AATACGCTAA TTGTTGTAGA GCATGACGAA GATACCATGA ATGCAGCAGA TCATATCATA
GATATGGGTC CGGGTGCTGG AATCCATGGC GGACACGTTG TTGCAGAGGG AACCCTCGAT
AAAATATTAA AAAGTGAAAA ATCACTTACA GGACAATACC TTAGCGGACG AAAAAAAATA
GAGGTTCCAG AGATAAGAAG AAAACCAAAC GGAAAATGGC TGGAAATTGT GGGAGCAAGG
CAGAATAATC TTAAAAACAT CAACGCAAAG ATTCCTCTCG GGGTGTTAAC ATCAGTTACG
GGTGTTTCCG GCTCGGGAAA AAGTACATTG ATAAATGAGA TTTTGTATAC CAGTCTTGCC
AGCCAGTTAT ATAGAGCCAA GGCAAGACCG GGTAATCATG ATACAATAAA AGGTATTAAA
AATATTGACA AGGTTATAAG TATTGACCAA TCACCAATCG GAAAAACCCC GAGATCAAAC
CCGGCAACCT ACACAGGTGT TTTTGATCTT ATCAGAGAAG TTTTTGCTTC AACTACCGAA
GCAAAAATGA AAGGATACAA AAACGGACGG TTCAGCTTTA ATATAAAGGG TGGAAGGTGT
GAAGCCTGCT CCGGGGATGG AATTATAAAA ATTGAAATGC ATTTCCTTCC TGACGTATAT
GTTCCTTGTG AAGTTTGTAA GGGAAAAAGG TACAACAGGG AGACTCTGGA AGTTAAGTAC
AAGGGGAAAA GCATATCGGA TGTTCTAAAT ATGACAATTG ACGATGCCCT GGAGTTTTTC
AAGAACATCC CCAAGATACA AAGAAAATTC CAAACATTGT ATGATGTCGG CTTGGGGTAT
GTCAAGGTTG GTCAGCCATC TACAACCCTG TCGGGAGGCG AGGCCCAGAG AGTAAAGCTT
GCTACCGAGT TATCCAAGCG AAGTACAGGA AAGACACTGT ACATTTTGGA CGAGCCTACA
ACCGGACTTC ATGTAGCTGA TGTTCACAGA TTGATAGATA TTCTTCAAAG GCTTGTGGAT
GCGGGTAATT CCATAGTAGT TATTGAACAC AATCTTGACG TCATAAAAAC TTCTGATTAT
ATAATTGATC TTGGACCCGA AGGTGGTAAC AAGGGAGGAA CGATTATTGC ACAGGGAACG
CCCGAAGAAG TTGCAAAAGT GAAAGAATCC TATACAGGTC AATATCTAAG TAAAATTTTA
GAAAAATAA
 
Protein sequence
MIRDNIFIKG AREHNLKNVD VEIPRDKFVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
SYARQFLGQM DKPDVDYIEG LSPAISIDQK TTTRNPRSTV GTVTEIHDYL RLLYSRIGIP
HCPKCGKEIA QQTIDQMVDQ IVSFEEGTRI QLLAPVVRGR KGEYHKLIEN AKKDGFVRLR
VDGQIVDVNE EIKLDKNKKH NIEIVVDRLV VRGDIQKRLV DSLETVLRLS GGIVIVDLVG
KEEILFSQNF ACSDCGISIE ELVPRMFSFN NPFGACQTCT GLGNLMKVDP ELVIPDRSLS
LTNGAISVTG WNIGSEDAYI RMIFNALAKH YKFDLDTPFN KLSGEIIDII LYGTRGEKIK
VDYEREYGSG SYMAGFEGVI NAIERRHNET QSESSKQYYE QFMSNNPCPD CKGARLKPES
LAVTVGGKNI HKVSSMSVAD TKDFFDNIEL SERDKMIANQ ILKEIAARIG FLVDVGLDYL
TLSRPAGTLS GGEAQRIRLA TQIGSGLMGV LYILDEPSIG LHQRDNEKLL KTLNRLRNLG
NTLIVVEHDE DTMNAADHII DMGPGAGIHG GHVVAEGTLD KILKSEKSLT GQYLSGRKKI
EVPEIRRKPN GKWLEIVGAR QNNLKNINAK IPLGVLTSVT GVSGSGKSTL INEILYTSLA
SQLYRAKARP GNHDTIKGIK NIDKVISIDQ SPIGKTPRSN PATYTGVFDL IREVFASTTE
AKMKGYKNGR FSFNIKGGRC EACSGDGIIK IEMHFLPDVY VPCEVCKGKR YNRETLEVKY
KGKSISDVLN MTIDDALEFF KNIPKIQRKF QTLYDVGLGY VKVGQPSTTL SGGEAQRVKL
ATELSKRSTG KTLYILDEPT TGLHVADVHR LIDILQRLVD AGNSIVVIEH NLDVIKTSDY
IIDLGPEGGN KGGTIIAQGT PEEVAKVKES YTGQYLSKIL EK