Gene Cfla_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1572 
Symbol 
ID9145458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1746177 
End bp1748285 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content74% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003636669 
Protein GI296129419 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0280577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00883315 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGGAC GGTCCGACCG CGCACGCACG GCCGGAGCTC TGGCCGCCGC ACTCGCCCTC 
GCCACGCTCG CCCCCACGTC CGCCATGGCG GCTCCCCCGG GCACCGCGGG CGCCGTCGGG
GAGGAGGTCG GGGTCGCGTT CTCGACGACG GGCCGGGTCT TCAGCGGGTC GCTGCAGATC
GGCCTGTCGA CGTCCGTGCC GGGTGCCGAG GTCCGGTACA CCACGGACGG CACCACGCCG
ACCCTCTCGT CGCCGGTCGC CTCGGGGCCG CTGACCCTCA CGCGCAGCAC CGAGGTCCGC
GCGCAGGCGT TCGTGGCCGG GGCTCCCACC GGCGAGCCCA CGTCGCAGCG GTACGTCGCG
AGCAACGTCA CGACGCGTCA CGACCTGCCC GTGCTCGTGC TCGACTCGCT CGGCAAGGGC
GTCGTCGGCG ACGACGCCCA CGCGGCGGCG GTCGTCGAGC TGCAGCCGCG GGGGGGCACG
ACGAGCCTGA CCGACGAGCC CGCGCTCGTC ACGCGTGCCG GGTACCGCCT GCGCGGGCAG
TCGAGCCGCA TGTTCGACAA GAAGCCGTAC CGCCTCGAGC TGTGGGACGA CGAGGGCGAC
GACCTCGACC AGCCGTTCTT CGGCATGCCC GCGGAGTCCG ACTGGGTGCT GCGCGGGCCG
TTCTCCGACA AGTCGCTCGT GCGCGAGGCG CTCACCCTGG ACCTGGGCCG CGAGCTCGGC
CTGCACGCAC CGCGCCACCG CCTCGTCGAG GTCTACGTGA ACGACGACGC GCAACCGGTC
GCGGCGAACG ACTACCGCGG CGTCTACCTG CTCGAGGAGA CGATCAAGAA CCAGAAGGAC
CGTCTCGACC TCAAGAAGCT CGACCCCGAG GACGTGACGT CGCCGCGCAT CGAGGGTGGC
TACATCATCA AGGCCGAGTG GCTCGCTGCC GAGCAGCCGC TCATCCCCTG CAGGGGCACG
TCGCGCTGCT GGAGCGACCT CGAGGTGCAC GACCCGGACG ACCTGGTGCC CGCACAGCTC
GACTGGATCG CCGGGTACGT CGGCCGCGTG AACGACGCCC TGCACTCGTC GAACCCCGCG
GACCCGCAGA CCGGCTACCC CGCGCTGATC GACGTCGAGT CGTTCGTCGA CCAGGTGATC
GTCAACGAGC TCAGCCGTGA CATGGACGCC TACTTCCGCA GCCAGTACTT CTACAAGGAC
CGCGGCGGGC TGCTCACCGC GGGGCCGCTG TGGGACTTCG ACCTCACGTA CGGCGTGGGC
GGCTTCTTCG GCAACGACCA GGTCTCGGGG TGGCAGTACC AGCAGTCGCG CCAGAGCCCC
GCGCCGCTCG ACTGGTTCTC GGTCCTGATG TCGGACCCCG CCTTCGTCAA CCACGTCAAG
GTGCGCTGGC AGGAGGCGCG CCGCGGACCG CTGTCCGACG CGGCCCTCCG CTCGCGGATC
GACGACCTCA CCGCGCCCCT CGGCGGCGCC GCCGCGCGCA ACTTCCAGCG CTGGCCGAAC
CTCACGACCC GGCAGATCGG CCCGTTCGTC ACGCCGACCG CCGGCACCTG GGAGGGGCAG
GTCGCACACC TCGAGGACTG GCTGCTGCGC CGCGCCGCGT GGCTCGACTC GACCGCCGCG
TGGGGCGGGC CGACGGACCC GCTGCCGACG CCGAGCGCGA CGCCGGCGCC CAGCGCCACA
CCCACGCCGA CGCCGACCCC GACGCCCAGC ACCACGCCCA CGCCCACGCC CACGCCCACG
GTGAGCCCGA CCCCGAGCCC CACGCCGACG CGCAGCGTCA CACCGTCGCC GACGCCCGTC
CAGGGCGGTC AGGGCTGCAC CGCGACACTG CGCACCGTGT CGTCCTGGCC CGGCGGGATC
CAGGGCGAGG TCACGGTGAC CGCGGGCGCC GCGGCGCTGC GCGGGTGGGC CGTGACGCTG
ACGCTGCCCG CGGGCGTCTC CGTCGCGCAG GTGTGGAACG CCGGCCTCAC GGGTTCGTCG
TCGACCGTCA CGGCACGCAA CGTCGACTGG AACGGGACGC TCGGGGCGGG CGCCTCGACG
ACGTTCGGGT TCCTCGGCTC CGTGACCGGG TCGCTCGAGG GCGTCACGCT CGCCTGCACC
GCGGCCTGA
 
Protein sequence
MPGRSDRART AGALAAALAL ATLAPTSAMA APPGTAGAVG EEVGVAFSTT GRVFSGSLQI 
GLSTSVPGAE VRYTTDGTTP TLSSPVASGP LTLTRSTEVR AQAFVAGAPT GEPTSQRYVA
SNVTTRHDLP VLVLDSLGKG VVGDDAHAAA VVELQPRGGT TSLTDEPALV TRAGYRLRGQ
SSRMFDKKPY RLELWDDEGD DLDQPFFGMP AESDWVLRGP FSDKSLVREA LTLDLGRELG
LHAPRHRLVE VYVNDDAQPV AANDYRGVYL LEETIKNQKD RLDLKKLDPE DVTSPRIEGG
YIIKAEWLAA EQPLIPCRGT SRCWSDLEVH DPDDLVPAQL DWIAGYVGRV NDALHSSNPA
DPQTGYPALI DVESFVDQVI VNELSRDMDA YFRSQYFYKD RGGLLTAGPL WDFDLTYGVG
GFFGNDQVSG WQYQQSRQSP APLDWFSVLM SDPAFVNHVK VRWQEARRGP LSDAALRSRI
DDLTAPLGGA AARNFQRWPN LTTRQIGPFV TPTAGTWEGQ VAHLEDWLLR RAAWLDSTAA
WGGPTDPLPT PSATPAPSAT PTPTPTPTPS TTPTPTPTPT VSPTPSPTPT RSVTPSPTPV
QGGQGCTATL RTVSSWPGGI QGEVTVTAGA AALRGWAVTL TLPAGVSVAQ VWNAGLTGSS
STVTARNVDW NGTLGAGAST TFGFLGSVTG SLEGVTLACT AA