Gene Ccel_2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2057 
Symbol 
ID7310760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2415617 
End bp2416741 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content39% 
IMG OID643608991 
ProductDNA protecting protein DprA 
Protein accessionYP_002506383 
Protein GI220929474 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.448378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAA TTGAATATTG GATTTGGTTA ACGTCTTTAG AGGGGTTAAG TTCAAAAAAA 
GCTTTAAACT TACTGGAAAC GTATAGAAAT CCTGAGGTTA TATACGGTCT TTCTGAAAGT
GAGCTTCAAA ACACAAGGGG TTTAACTGAA AAAAATGTAA AGGAGCTATT AAACTCAAAT
AAAAGGGAGA GGGTTGGCTC CATATATCAG ATGTTAGTCC GGTATAACAT AAAAATGGTA
AATATTTTTG AAGAAAACTA CCCTCAAAAG CTGAAAAATA TTTATGATCC GCCCATTGCT
TTGTATTATA GGGGAAACCT TGACTCAGAC AGCTTTTCAA TAGCGGTTGT GGGATCAAGA
AGGACCACCG GGTATGGTGC GAATACCGCC AGAAAATTGT CATATGACCT GGCAATGAGG
GGTGTAACAA TAGTAAGTGG TCTGGCCAGG GGGATAGACA GTATTGCCCA TAAAGGCTGT
CTGGACGCAG GAGGAAAAAC CATAGCCGTT CTTGGTTCGG GGCTTGACAA TATATATCCC
CCGGAAAATG CAGGACTGTT TAAGGATATA ATTGATTCCG GGGGCTTGGC ATTATCTGAA
TACCCTCCGG GAATGCCGCC GCTTCAGCAT AATTTCCCGG CACGAAATAG AATAATAAGC
GGAATTTCGG GCGGTGTCGT TGTGATTGAG GCAGCTAAGA GGAGCGGTTC CTTAATTACG
GCAGGCTGTG CTTTAGAGCA GGGGAGAGAG GTTTTTGCTG TTCCGGGAAA TATCGACTGT
GCGTACAGCA TGGGAACAAA CCAATTAATT AAAGAAGGAG CTAAACTGGT ATTAAATGCC
ACAGATGTTC TGGAAGAATT TGAATACAAC GGAATACAGA ATTTTACACC CGTTCAGGGG
GATATAGATG AGAAAATCAG TAAAAAATAT CTTAATCTAT TTAAAGGGCT TTCAGCAGGT
GAAATAAAAA TATTAAAGGT AATTTTTAAC GGTGCAAATA ATATTGATGA AATTCTTGAG
AGAAGTAATT TTTCTGCGAA AGATGCAAGC AGTATACTGT TTATGCTTGA AATGAAGGGT
GTAATCAAAC AGAATCCGGG TAAATTGTTT GAAGTAATAA TTTAG
 
Protein sequence
MREIEYWIWL TSLEGLSSKK ALNLLETYRN PEVIYGLSES ELQNTRGLTE KNVKELLNSN 
KRERVGSIYQ MLVRYNIKMV NIFEENYPQK LKNIYDPPIA LYYRGNLDSD SFSIAVVGSR
RTTGYGANTA RKLSYDLAMR GVTIVSGLAR GIDSIAHKGC LDAGGKTIAV LGSGLDNIYP
PENAGLFKDI IDSGGLALSE YPPGMPPLQH NFPARNRIIS GISGGVVVIE AAKRSGSLIT
AGCALEQGRE VFAVPGNIDC AYSMGTNQLI KEGAKLVLNA TDVLEEFEYN GIQNFTPVQG
DIDEKISKKY LNLFKGLSAG EIKILKVIFN GANNIDEILE RSNFSAKDAS SILFMLEMKG
VIKQNPGKLF EVII