Gene Cfla_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1872 
Symbol 
ID9145765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2083663 
End bp2086521 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content75% 
IMG OID 
ProductDSH domain protein 
Protein accessionYP_003636968 
Protein GI296129718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0864183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCCC GCCGCCGCAG CACCCGCTCC GCCCCCACGA CCCCCGCCGC GCGCCCCGGT 
GCCGTCGAGC CCTCGCCCGC GGAGCGCTAC GCCGCGTCCC GCCGCCGCGC CGCCGCCGAG
CACGGGGAGC TCGCCGTCTT CCGCGAGCGC CTCGACTTCC CGCTCGACGA CTTCCAGGTC
GAGGCGTGCG CCGCCCTCGA ACGTGGCAGC GGCGTCCTCG TCGCCGCACC CACCGGCGCA
GGAAAGACCG TCGTCGGCGA GTTCGCGGTG CACCTGGCCC TCGTCTCGGG TCGCAAGGCG
TTCTACACGA CGCCCATCAA GGCGCTGTCC AACCAGAAGT ACGCCGACCT CGCCCGCGTC
CACGGCACCG AACGCGTCGG GCTGCTCACC GGCGACACCT CCGTCAACGG CGACGCCGAC
GTCGTCGTCA TGACCACCGA GGTGCTGCGC AACATGCTCT ACGCCGGCTC GTCGGCGCTC
GACGGGCTCG GGTACGTCGT CATGGACGAG GTCCACTACC TCGCCGACCG CTTCCGCGGC
CCCGTCTGGG AGGAGGTCAT CATCCACCTC CCCGACGACG TGCAGCTCGT GTCGCTGTCC
GCCACCGTCT CCAACGCCGA GGAGTTCGGC GACTGGCTCG CGACCGTGCG CGGCGACACC
ACCGTCGTCG TCAGCGAGCA CCGCCCGGTC CCGCTGGGTC AGCACGTGCT CGTCGGCGAC
CAGCTCCTCG ACCTGTACGC CGGGCACGTC GACCCCACCG ATCCCGGCGT CGACCCACCG
ATCAACCCCG ACCTCACCCA TCTGCTGCGC GGGCGCACCG GCGGCGACCG CGACCGCCGC
GCACCGCGCG GCCGCCCCGG GCGAGCCCGC GACCACCGCC CTGCCGGCGG CGTGCGGCCC
GTCCCGCGCT TCGTCATGGT CGACGAGCTC GCCGAGGCGC GCCTGCTACC CGCGATCGTC
TTCATCTTCT CCCGGGTGGG CTGCGAGGCT GCTGTGCAGC AGTGCCTGTC GGCCGGTGTG
CGGCTCACGA CACCGGCCGA ACGCGCCGAC ATCCGCCGCG TCGCCGAGGA GCGCTGCGCC
GCGATCCCGC CGGAGGACCT CGAGGTGCTC GGCTACGACG CCTTCGTCGA GGGCCTGGTG
CGTGGCGTCG CCGCCCACCA CGCGGGCATG CTGCCGCTGT TCAAGGAGAC CGTGGAGGAC
CTGTTCTCCC GTGGCTGGGT CAAGGTCGTG TTCGCCACCG AGACGCTCGC GCTCGGCATC
AACATGCCCG CGCGCTGCGT GGTCCTCGAG AAGCTCGTCA AGTGGGACGG CTCCGCGCAC
GTCGACGTCA CGCCGGGGGA GTACACCCAG CTGACCGGCC GGGCGGGGCG CCGCGGCATC
GACACCGAGG GGCATGCCGT CGTCGTCGCG CACGGCGCTC TCGACCCCGT GCAGCTCGCC
GGGCTGGCGT CCCGCCGCCT CTACCCGCTG CGCTCGTCGT TCCGTCCCAC GTACAACATG
GCGGTCAACC TCGTGGCCCA GGTAGGCCGC GAGCGCGCCC GTGACGTCCT CGAGACGTCG
TTCGCGCAGT TCCAGGCGGA CCGGGGGGTG GTGGGTCTGG CCCGCCAGGC GCAGACCCAC
GCGGAGGCCC TGGAGGGCTA CGCCCAGGCC ATGACGTGCC ACCTCGGGGA CTTCGCCCAG
TACATGGGGC TGCGGCGGGC GATCACGGAC CGCGAGCGCG GTCTCGCCAA GGAGCAGGCC
GCCACACGCC GCGCCGACGT CGCCCGCACG CTCGAGGGGC TGCACGTCGG CGACGTCGTC
GAGATCCCCG GGGGCCGACG CGCCGAGCAC GCGGTCGTCG TGGATCCCGG CGGTCCGGGC
GGGTTCGACG GGCCACGCCC CACCGTGCTG ACCACCGACC GCCAGGTGCG CCGGCTCACG
GTGGCCGACG CCGGCTCGGG GCTGCGCACG GTCGGGCGCC TGGCCGTGCC GGCGAGGTTC
GAGCTGCGGG TCCCGGCGGC CCGCCGCGAC CTGGCTGCAC GGCTGCGGGC GCGGCTCGAC
GAGCTCGACG TCGTCCCGGC GCCTGCGCGC GCCCCGGGCC GGCGCGCCGA GCGGCGTTCG
GCTGCCGCCG ACGACGCCGA GCTCGCCGCA CTTCGCCGTG AGCTGCGTTC GCACCCCTGC
CACCGGTGCC CGGAGCGGGA GGACCACGCG CGCTGGGCCG AGCGCTGGGA GCGGCTGCGC
TCGGAGCACG CCGCCCTCGT GCGGCGCATC GCGGGGCGCA CGGGCTCGAT CGCGGCGGTC
TTCGACCGGA TCTGCGACGT GCTCGGGACG CTCGGGTACC TCACGAGCGA CGACTCCGGT
GCGCTACGCG TCACCGACGA CGGCCGCTGG TTGCGGCGGC TGTACGCCGA GAACGACCTC
GTGCTCGCCG AGTGCCTGCG TCGGGGGGTG TGGGACGAGC TCGACGCCCC CGGGCTCGCG
GCCGCGGTCT CGACCCTCGT GTACCGCTCA CGGCGCGACG ACGAGGGCGA CGCACGGGTG
CCCGGCGGGC CCGACGGCCG GCTCGGCCGG GCGCTCGACG CGGCGGTGCG GGCGTGGTCG
CAGCTCGACG ACCTCGAGCG GGAGGCACGC CTCGAGACGA TCCAGCCGCT CGACCTCGGT
CTCGTGCAGC CCGTCCACCG GTGGGCTGCC GGCCGCAGCC TGGACGCCGT GCTGCGGGGC
TCGGACCTGG CCGCCGGCGA CTTCGTCCGC TGGTGCAAGC AGGTCATCGA CGTGCTCGAC
CAGGTCTCCG GCGCCGCCCC GACCGCGCGC CTGCGCACCA CCGCGGCGAA GGCGGTCACG
GCGATGCGCC GCGGTGTCGT GGCGTACTCG ACCGTCTGA
 
Protein sequence
MPSRRRSTRS APTTPAARPG AVEPSPAERY AASRRRAAAE HGELAVFRER LDFPLDDFQV 
EACAALERGS GVLVAAPTGA GKTVVGEFAV HLALVSGRKA FYTTPIKALS NQKYADLARV
HGTERVGLLT GDTSVNGDAD VVVMTTEVLR NMLYAGSSAL DGLGYVVMDE VHYLADRFRG
PVWEEVIIHL PDDVQLVSLS ATVSNAEEFG DWLATVRGDT TVVVSEHRPV PLGQHVLVGD
QLLDLYAGHV DPTDPGVDPP INPDLTHLLR GRTGGDRDRR APRGRPGRAR DHRPAGGVRP
VPRFVMVDEL AEARLLPAIV FIFSRVGCEA AVQQCLSAGV RLTTPAERAD IRRVAEERCA
AIPPEDLEVL GYDAFVEGLV RGVAAHHAGM LPLFKETVED LFSRGWVKVV FATETLALGI
NMPARCVVLE KLVKWDGSAH VDVTPGEYTQ LTGRAGRRGI DTEGHAVVVA HGALDPVQLA
GLASRRLYPL RSSFRPTYNM AVNLVAQVGR ERARDVLETS FAQFQADRGV VGLARQAQTH
AEALEGYAQA MTCHLGDFAQ YMGLRRAITD RERGLAKEQA ATRRADVART LEGLHVGDVV
EIPGGRRAEH AVVVDPGGPG GFDGPRPTVL TTDRQVRRLT VADAGSGLRT VGRLAVPARF
ELRVPAARRD LAARLRARLD ELDVVPAPAR APGRRAERRS AAADDAELAA LRRELRSHPC
HRCPEREDHA RWAERWERLR SEHAALVRRI AGRTGSIAAV FDRICDVLGT LGYLTSDDSG
ALRVTDDGRW LRRLYAENDL VLAECLRRGV WDELDAPGLA AAVSTLVYRS RRDDEGDARV
PGGPDGRLGR ALDAAVRAWS QLDDLEREAR LETIQPLDLG LVQPVHRWAA GRSLDAVLRG
SDLAAGDFVR WCKQVIDVLD QVSGAAPTAR LRTTAAKAVT AMRRGVVAYS TV