Gene Ccel_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3421 
Symbol 
ID7312491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3975550 
End bp3976770 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content35% 
IMG OID643610326 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_002507689 
Protein GI220930780 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2866] Predicted carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000158618 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAA GAATAAAGGT TTTTATACTT GCTGTGATCT ACTCACTAAC AGCCTTCAGT 
ACATTTTCTG TACAAGCATC AAAAGTTACA TATTCAAAAT CTCTACAACA AATATATAAG
AGTAGGGATG TATATAAAGA CACACAGAAA CGTCTTTCAG AGTTTAATAA AAACTACAGT
AATATTACAT ATCTCTTTTC GGCAGGAAAA AGTGTTCAAA AAAGAGATTT ATCAGTATTA
AAAATAGGTA ATGGGTCCAA GAAGATTTTT ATAAATGCAG CTCACCATCC AAGAGAATAC
ATTGGTACAA TCCTTACTCT AAATCAGATT CAGAATTTAC TGGAATCGTA TGCCAATAAC
GGTAGTATTG ACGGCCAGAA AATCAGGAAT CTGCTCGATA AACAGGTAAC CTTTTATTTC
ATGCCACTGG TTAATCCTGA TGGAGTGCAG ATATGTATCA ATGGAACTCA ATCCTATTAT
TTCAACGCCA ACAAGGTAGA TCTCAATCAT AATTACGATG CACTCTGGAG TAAAAAAATT
ACCTCTACAT ATTCTACCGG AGCCAAACCC TTTTCTGAGC CGGAAACTCA GGCAGTAAGA
GATTTATGCC TTAATATAGA ATTTGATCTG ACAATTGCGT ATCATGCTGC AGGAGATATA
ATCTATTGGT ATTTTGGGCA GCAAGGTGCG GATAGAACCA GGGACCTTGC ATATGCAAAT
ATCCTAAAAA CTACAACCGG CTATAGTTTG GTGAGTTCTG CTAACTATAA ATCATCCACT
TCAGGTTTTA AAGATTGGTG TGTTCAGAAA TTAAAGATAC CGTCATTTAC AATTGAAATA
GGAGGCAAAC GGGGTATCAT CAAACCGGTA GAATGGTCAT ATTACAATAC TATTTGGAAG
CAAAATAAAC TTGTCCCTGT TAGAGTTGCA AAACAACTGA TGAAGCAGAC AAAATTTAAT
GGTGATAAAA CGACTGCATT AATATATAAG AACAGTTTAT TCCGCCAAGG ACAGATACTT
TCAGTCAACG GGAAGCAATA TATTGCTGAA AAAGCTGTTC CTATGCTGGC CGGAAAATTA
TCTATAAATC AACAGAACAA GCTTAATAAA ACAAAAATTT CTATTAAAAA GACACCCTAT
TTGAGGCTGG AAACCTTGGC AGAATGTTTG AATCTAAAAT TTAAGTTTGA AAAAACCTCA
AATACGGTTT ATATTTCGTA A
 
Protein sequence
MSKRIKVFIL AVIYSLTAFS TFSVQASKVT YSKSLQQIYK SRDVYKDTQK RLSEFNKNYS 
NITYLFSAGK SVQKRDLSVL KIGNGSKKIF INAAHHPREY IGTILTLNQI QNLLESYANN
GSIDGQKIRN LLDKQVTFYF MPLVNPDGVQ ICINGTQSYY FNANKVDLNH NYDALWSKKI
TSTYSTGAKP FSEPETQAVR DLCLNIEFDL TIAYHAAGDI IYWYFGQQGA DRTRDLAYAN
ILKTTTGYSL VSSANYKSST SGFKDWCVQK LKIPSFTIEI GGKRGIIKPV EWSYYNTIWK
QNKLVPVRVA KQLMKQTKFN GDKTTALIYK NSLFRQGQIL SVNGKQYIAE KAVPMLAGKL
SINQQNKLNK TKISIKKTPY LRLETLAECL NLKFKFEKTS NTVYIS