Gene Cfla_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3022 
Symbol 
ID9146934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3350967 
End bp3352928 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content75% 
IMG OID 
ProductBeta-galactosidase 
Protein accessionYP_003638104 
Protein GI296130854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGGTG ACGCCCGCAT CCGCTACGGC GCCGACTACA ACCCGGAGCA GTGGCCGCAG 
GACGTGTGGC ACGAGGACGT GCGGCTGATG CGCGCCGCGC ACGTCACCAC GGCGACGGTC
GGGGTCTTCT CGTGGTCGCG CCTCGAGCCC GCACCGGGGG AGTACGACCT CGGGTGGCTC
GACGACGTGC TCGCGCGGCT GCACGACGGC GGGGTGCGTG TGGTGCTGGC GACGGCGACC
GCGTCGCCAC CCGCGTGGTT CGTGGCGCGC TACCCGGACG CGCTGCCCGT GACGCAGGAC
GGTGTCCGGC TGGGGTTCGG GTCGCGGCAG CACTACTCGC CGTCGTCACC GGACTACCGG
CGGCACGCGC TCGCCCTCGT CGAGCAGCTC GCGGCCCGCT ACGCGGACCA CCCCGCGCTG
GAGATGTGGC ACGTCAACAA CGAGTACGGC TGCCACGTCG CGCGGTCGTA CGACGCGCAC
ACCGTGGCCG CGTTCCGGGC GTGGCTCGAG ACGCGGTACG GCACGGTCGA CGAGCTCAAC
CGCGCGTGGG GTACGGACTT CTGGTCGCAG CGCTACTCGT CGTTCGACGA GGTGGGCGCG
CCCGCCGCCG CGCCGACGTT CCGCAACCCC ACGCAGGAGC TCGACTTCCG CCGCTTCACG
TCCGACGCGC TGCTCGCGCT GCACCGCGCC GAGTCCGAGG TCATCCGCCG CCACAGCCCC
GACGTGCCGA TCACGACGAA CTTCATGGGC TTCTTCCCCG ACGCGGACTA CTGGGCGTGG
GCGCCCTACG TCGACGTCGT CAGCGACGAC GCCTACCCCG ACCCGGGTGA CCCGCAGGCG
CACGTGCGGC TCGCCGCGCA GCGCGACCTC ATGCGCGGCC TCGGGGAGGG GCGGCCGTGG
CTGCTCATGG AGCAGTCGCC GAGCGCCGTG AACTGGCGGC CGCGCAACGC CCCGCGGCCG
CACGGCCAGC ACCGCGCGCA CTCGCTGCAG GCCGTCGCTC GCGGTGCCGA CGGCATCCTG
CACTTCCAGT GGCGGCAGTC CGCCGCGGGC GCCGAGAAGT TCCACTCGGC GCTGGTGCCG
CACGCCGGCC CGGACTCGCG GGCGTTCCAC GACGTGTGCG CGCTGGGCGA CGAGCTCGCG
CGGCTCGGCG ACCTCGTCGG CGCGCGCGTC GCCGCGCACG TCGCGATCGT CCTCGACTGG
GACTCCTGGT GGGCGCTCGA GCAGGACGCC ACGCCGACGC GCGTGGCGTA CGTCGAGCGG
TTCCTCGACT GGTACGCGCC GTTCCTGCGC CGCGGCGTCA CCGTCGACGT CGTGCCCGCC
GGGGCCGACG TCGTGGGGTA CGACCTGGTC GTCGTGCCCC TGCTGCACGT CGCGCGCGCC
GCGCACCTCG ACGCCCTCGA CGCGTACGTC CGCGCCGGCG GGAACCTCGT CGTGACGTAC
GCGACGGCGG TGCTGGACGA GGACCTGCAC GTCTACCTCG GCGGCTACCT GGGCCCGCTG
CGCGCGACGC TCGGTGTGCG CGTCGAGGAG CTCGCACCCA CCGCGGGACC CGACGGGTCG
CCCGGTGGTC CGCTGCGCCT GACCGGCGAG CTCGCCGGCG AGGCGTCGCT GTGGCAGGAC
GTCCTCGTCG TCGACGACGC CGAGGTGGTC GCCACGTTCG ACGACGGGTA CGCCGCGGGC
GGACCGGCCG TGACCCGGCG CGAGCACGGG GACGGCGTCG CCTGGTACGT CGGCACGCAG
CCGTCCGCGC GCGTCCTCGA CGCGCTCGTC GACCGCCTGC TCACCGACGC CGACGTTCCC
GCGCTCTTCC CCTCACCGGT CGAGGGCGTC GAGGCCGTGC GGCGCGGGGA CCGGCTCGTC
GTCGTCAACC ACACCGGCGC GCCGCGCACG CTGGCGCTCG CGGGGCGCGT GCTGCACCTC
GGGCCGCACG ACGCGCAGGT GCTCACGGAC CTGCCGGCCT GA
 
Protein sequence
MPGDARIRYG ADYNPEQWPQ DVWHEDVRLM RAAHVTTATV GVFSWSRLEP APGEYDLGWL 
DDVLARLHDG GVRVVLATAT ASPPAWFVAR YPDALPVTQD GVRLGFGSRQ HYSPSSPDYR
RHALALVEQL AARYADHPAL EMWHVNNEYG CHVARSYDAH TVAAFRAWLE TRYGTVDELN
RAWGTDFWSQ RYSSFDEVGA PAAAPTFRNP TQELDFRRFT SDALLALHRA ESEVIRRHSP
DVPITTNFMG FFPDADYWAW APYVDVVSDD AYPDPGDPQA HVRLAAQRDL MRGLGEGRPW
LLMEQSPSAV NWRPRNAPRP HGQHRAHSLQ AVARGADGIL HFQWRQSAAG AEKFHSALVP
HAGPDSRAFH DVCALGDELA RLGDLVGARV AAHVAIVLDW DSWWALEQDA TPTRVAYVER
FLDWYAPFLR RGVTVDVVPA GADVVGYDLV VVPLLHVARA AHLDALDAYV RAGGNLVVTY
ATAVLDEDLH VYLGGYLGPL RATLGVRVEE LAPTAGPDGS PGGPLRLTGE LAGEASLWQD
VLVVDDAEVV ATFDDGYAAG GPAVTRREHG DGVAWYVGTQ PSARVLDALV DRLLTDADVP
ALFPSPVEGV EAVRRGDRLV VVNHTGAPRT LALAGRVLHL GPHDAQVLTD LPA