Gene Cfla_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2016 
Symbol 
ID9145911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2243974 
End bp2246067 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content75% 
IMG OID 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_003637110 
Protein GI296129860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0435226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACCCA TGACCTTCGA CGCGCCGCTG CTCGACCCTG CCCACCCCTT CGCCCGTGCG 
TCCGACCTGC CCTACGGGCT GCCCGACTTC CGGGTGGTGC ACGAGGAGCA CTACCTGCCC
GCGCTGGTCG CCGGCATGGC GGAGCAGCGT GCCGAGGTCG AGGCGATCGC GACCGACCCG
GCGCCGCCCA CGGTGGCGAA CACCCTCGAG GCGCTCGAGC GCTCGGGCAG GCTGCTGTCG
CGCGCCGCGT CGGCGTTCTA CGTGCAGTCC GGCGCGGAGT CGACACCGGG CCTGCAGGCC
GTCGAGGAGC AGGTCGCGCC GCTGCTGGCC GCCCACTCCG ACGCGATCTG GCTGGACGCC
CGGCTGCACG CCCGCGTCGA GGCGCTCGCG GCGTCCCTCG AGGGGACGGA GCTCGCCCCC
GACACGGCGT GGCTGCTGCA CCGCACGCGT GAGCGGTTCA CGCGCGCCGG CGTCGGCCTG
CCGGAGGCCG ACCAGGAGCG GCTGCGTGCG ATCAACGCGG AGATCACGTC CCTCGACGCG
GCGTTCGGCC GGCTGCTGCT CGCGGCGACC AATGCCGCCG CGGTCCTCGT GACCGACGAG
GCCGAGCTCG ACGGCCTGCC CGACGACGCC CGCGCCGCCG CCGCCCAGGC GGCCACCGCG
GCCGGCCACG AGGGCGCGTG GCTGCTGGAG CTGCAGCTGC CCACGCAGCA GTCCGTGCTC
TCCCTGCTGC GCGACCGCGG GCTGCGCGAG CGCGTCATGC GGGCGTCCCT CACGCGCGGC
GCCGGCGGCC AGCACGACAC CCGGACGGCG CTGCTCGGCC TGGTGCGCCT GCGTGCCGAG
CGCGCGCAGC TCCTCGGGTT CGAGCACCAC GCGGCCTACG TCGCGGCGGA CGCGACGGCC
GGCACGGCGC AGGCCGTCGA GGAGATGCTC GCGCGCCTCG CACCGGCCGC CGTCGCGAAC
GCGCGCACCG AGGCGGTGGA CCTCGAGCGG GCCCTGCAGG CCGACCACCC GGGCGCGACG
CTCGAGCCGT GGGACTGGGC GTACTACGCC GGCCGCGTCC GTCAGGAGCG CCGCTCGCTC
GACGAGGCCG CGCTGCGTCC GTTCCTGGAG CTCGAGCGCG TCCTCACGCG GGGTGTGTTC
CACGCGGCGA ACCGGCTCTA CGGCCTGACG TTCTCCGAGC GCCACGACCT CGTCGGCTAC
CACCCCGACG TCCGGGTCTA CGAGGTCTTC GACGCCGACG GGGCCGGCCT CGGGCTGTTC
CTCGGCGACT TCTGGACACG CCCGGCCAAG CGCGGCGGGG CGTGGATGAA CTCGCTCGTC
GACCAGTCGA TGCTGCTGGG GGAGCAGCCC GTGGTCGTCA ACAACCTCAA CGTGCCCAAG
CCCCCGCCGG GCCAGCCCAC GCTGCTGACG TGGGACGAGG TCATCACGCT GTTCCACGAG
TTCGGCCACG CGCTGCACGG GCTGCTGTCG GCCGTGCGGT ACCCGTCGCA GTCCGGCACG
AGCGTGCCGC GCGACTTCGT CGAGTACCCG TCGCAGGTCA ACGAGATGTG GGCGTGGGAC
ACCGAGGTGC TGCGCTCGTA CGCCGTGCAC CACGCGACAG GTGAGCCGCT GCCGCAGGAA
TGGGTGGACA CGCTGCTCGC CGCCCGCCAG GACGGCGAGG GCTTCGCCAC CACCGAGTAC
CTGGCTGCCG CGCTGCTCGA CCAGGCGTGG CACCGGCTGG CGCCCCAGGA CGTCCCGGCG
GACCCGGACG AGGTCGAGGC GTTCGAGGCG CGGGCGCTGG CCACCGCCGG CGTCGACCTG
CCCACCGTCC CGCCGCGGTA CCGCACGACG TACTTCAACC ACATCTTCTG CAGCGGGTAC
TCCGCGGGCT ACTACGCGTA CATCTGGTCC GAGGTGCTCG ACGCCGACAC CGTCGGGTGG
TTCGCCGAGA ACGGCGGCCT GCGCCGCGAG AACGGCGACG TGTTCCGCGC CCGGCTGCTG
GGTCGCGGCG GGTCGATCGA CCCGCTGCAG TCGTTCCGCG ACCTGCGCGG ACGCGACCCG
CGCATCGAGC CGCTGCTCGA GCGCCGAGGC CTGTCCGGGG CGGTCGCCCG GTGA
 
Protein sequence
MGPMTFDAPL LDPAHPFARA SDLPYGLPDF RVVHEEHYLP ALVAGMAEQR AEVEAIATDP 
APPTVANTLE ALERSGRLLS RAASAFYVQS GAESTPGLQA VEEQVAPLLA AHSDAIWLDA
RLHARVEALA ASLEGTELAP DTAWLLHRTR ERFTRAGVGL PEADQERLRA INAEITSLDA
AFGRLLLAAT NAAAVLVTDE AELDGLPDDA RAAAAQAATA AGHEGAWLLE LQLPTQQSVL
SLLRDRGLRE RVMRASLTRG AGGQHDTRTA LLGLVRLRAE RAQLLGFEHH AAYVAADATA
GTAQAVEEML ARLAPAAVAN ARTEAVDLER ALQADHPGAT LEPWDWAYYA GRVRQERRSL
DEAALRPFLE LERVLTRGVF HAANRLYGLT FSERHDLVGY HPDVRVYEVF DADGAGLGLF
LGDFWTRPAK RGGAWMNSLV DQSMLLGEQP VVVNNLNVPK PPPGQPTLLT WDEVITLFHE
FGHALHGLLS AVRYPSQSGT SVPRDFVEYP SQVNEMWAWD TEVLRSYAVH HATGEPLPQE
WVDTLLAARQ DGEGFATTEY LAAALLDQAW HRLAPQDVPA DPDEVEAFEA RALATAGVDL
PTVPPRYRTT YFNHIFCSGY SAGYYAYIWS EVLDADTVGW FAENGGLRRE NGDVFRARLL
GRGGSIDPLQ SFRDLRGRDP RIEPLLERRG LSGAVAR