Gene Cfla_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1029 
Symbol 
ID9144904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1138949 
End bp1141327 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content78% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003636134 
Protein GI296128884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.746729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGGT TCAGGCGCGG CACGCTCGTC GTGGCGCTCG TCGTCGCGTT CGTGCTGGGA 
GCGGTCCTCG CGACGGCGCT CGGCACCGGC GTCGCCGGCC CCAAGGACCT GGTCGTGCCC
GACGGGTGGC GCACCGCGTG GCGGAGCGCG CACGTGGTCG AGGGCGACGA CGTCGCGCTG
GCGTGGGGGG ACCGTGCCGG TGAGGACCCG ACCGCGGCAC CCGAGGGCCT GAGGTTCGAC
CCCGACGTCG TGCTCGCCCA GCTCGAGGCG CTGCACGCGC TCGACGTCGA CGCGCTCGGG
CTCGGTGCCC CCGGTGGCCC GCTCGCGACG CGCAAGCTCC TCGTGGTCGT CGACGGCACG
TGGAGCGCGG GCCCGGGCGC GACGGCGGAC ATGGCCCCCG TCGTCGGCGG CGGCCTCTCG
GTGGGGGCCG ACGCCGTCCC CCTGACGCAG GGCGCCGTGG TCGACGGCGT GGCGCTGCTG
CGGGTGCGGC CCGAGGTGCT GGCGGGCTCG GTGGGTGACG ACGCGCCGGC GTCGGACGCC
GGTGGCGTGG TGGCCGCGAC GAGGGACGCC ACCCCGCACG CCACCCCGGA CGCCACCGCC
GGCGCGCGGG CGCGTGCCCC GCAGGGCACG CCCTGGGAGC TGGCACGCGG CGTCGCCGAG
ACCCTGCAGC ACCTCACCGA GGCCGCCCAC CCGGGCCACG GCCTGACGCC CGAGGCGGCC
GACGTGCTGC GTCCGGCCGC GTCGGCCTAC CTCGCGACGT ACGCGGTGCG CGGCGAGCAC
GCCGACGTCT CCGACCACGT CCTGGCACCA CAGCTAGCGT GGGGCAGCCC CCGCCACGGG
GCCGCCGGGT GGCTGCTCCT GCAGCACCTG GCCGACCGCG AGTCGCCGAC GCTGGTGCAG
CGGCTCTGGA CCGAGTCCCT CGAGACCGAG CACGTGCTCG CGGCGTACGC GCGGCTCACG
CAGTCGGACG CCTCCGGTCT CAACCGGCGC GTCGCCCAGT ACGCGATGCG CGCCGCCGTG
GCGGACGTGT CCGGCGCGGG CGGCCCCGGG GACCTCCTGG AGCGGCTGGA CCCCGTCCTC
GTCGCCCACC GCACGACGCC CACCGAGGCC GTGCCCGACG ACCCGGGGCA CCACCGTGTC
ACCGGGGCCT TCGCCCCGGC GGCCTACGGC TACACGGTGG TGCGGCTCAC GCCCGACGGA
TCGGGCGCGG ACGTGCGCGT GCGGGTGCGC GGGCACGCCG AGGAGCTCGC CGGCAAGGAC
CCCGGCTGGA GCTTCGGGCT CGTCGCCGTC GGCGCGAGCG GGCCGCGGTA CGGCCCGGTG
ACCGAAGCCG TCGACGGCGA GCTGCGCCTT GCGCTGCACC CGGGCGAGGA CGAGCTGTAC
CTCGTGGTCG CCGCGACGCC GACGCGGGTC GTGGCGCCCA CCGCCGAGGG CTTCGCCCGC
ACGACGCGCT ACCCGTACGA GTTCCGGGTC GCCGGGGCCG CCGTCGCCGA GCCCGACGTC
GCGGACGTCG CAGGCGGGCA CGCCCACCCG AACGGCGGCG GCTGGGTCGC GGACGACGCG
GACGTCGACC CGGCGGCGTA CGTCGCCGCC GGGGCCGTGG TGCGCGGCGA CGCGACGGTC
GGCCCCGGCG TCCGGCTCGA GGGACGCGCG TGGGTCGAGG CCGGTGCGGA GCTCACGGGC
GACGTGGTGG TCCGGGACGC CGCCGTCGTG CGGGGCACGG CGCGCCTGAC CGGTCACGTC
CTCGTCGGCG GTGACGCGGT CGTCGGTTTC GCGTGCGACG CGGGCGCCTA CACGTCCTAC
CGGCCCACCG CGACGTGCGA CCCGGGCGCG GTCGACACCG ACGTCAACAC GGTCGTCATG
CCGTTCGCGC CGAGCGACAC CCGGCTGAGC ACCGCCGCCG CCACCATCGC ACCGTCCCCC
GAGCCGGCAC CCGGGCAGAC TCCCACGCCC GCCCCGGCGT CCGCCGACGC ACCGTCCCCG
TCGGGGCCCG CCACGACGCC CGACACCGTC CCGCCGCCCG CCGCCGCCAC GTCGCCCGGG
GTGGCCGCGC CGCCTGCCGC CGTGCCTGCG GGCGCCTGCA CCGCCTCGTA CCAGGTGGTG
ACCTCGTGGC CCGGCGGGTT GCAGGTCCAG CTCGTCGTCA CCGCGACCAC GTCGGGCGTC
AACGGCTGGG TGCTGACGTG GACGCAGCCG CTCGGCCTGG AGATGGCCGA CTCGTGGGGC
GCGGAGATCA CGCGCAGCGG GCGCACCGTC ACGGCCGAGA ACCTGTCGTG GAACGGGTCG
ATCGCGAACG GCGGCAGCGT GACGCTCGGG TTCAACGCCG CCGCGGAGGG CGAGAGCGCC
CTGGAGGTCC CGCAGGTGCG CTGCGAGCAC ACCGGCTGA
 
Protein sequence
MGRFRRGTLV VALVVAFVLG AVLATALGTG VAGPKDLVVP DGWRTAWRSA HVVEGDDVAL 
AWGDRAGEDP TAAPEGLRFD PDVVLAQLEA LHALDVDALG LGAPGGPLAT RKLLVVVDGT
WSAGPGATAD MAPVVGGGLS VGADAVPLTQ GAVVDGVALL RVRPEVLAGS VGDDAPASDA
GGVVAATRDA TPHATPDATA GARARAPQGT PWELARGVAE TLQHLTEAAH PGHGLTPEAA
DVLRPAASAY LATYAVRGEH ADVSDHVLAP QLAWGSPRHG AAGWLLLQHL ADRESPTLVQ
RLWTESLETE HVLAAYARLT QSDASGLNRR VAQYAMRAAV ADVSGAGGPG DLLERLDPVL
VAHRTTPTEA VPDDPGHHRV TGAFAPAAYG YTVVRLTPDG SGADVRVRVR GHAEELAGKD
PGWSFGLVAV GASGPRYGPV TEAVDGELRL ALHPGEDELY LVVAATPTRV VAPTAEGFAR
TTRYPYEFRV AGAAVAEPDV ADVAGGHAHP NGGGWVADDA DVDPAAYVAA GAVVRGDATV
GPGVRLEGRA WVEAGAELTG DVVVRDAAVV RGTARLTGHV LVGGDAVVGF ACDAGAYTSY
RPTATCDPGA VDTDVNTVVM PFAPSDTRLS TAAATIAPSP EPAPGQTPTP APASADAPSP
SGPATTPDTV PPPAAATSPG VAAPPAAVPA GACTASYQVV TSWPGGLQVQ LVVTATTSGV
NGWVLTWTQP LGLEMADSWG AEITRSGRTV TAENLSWNGS IANGGSVTLG FNAAAEGESA
LEVPQVRCEH TG