Gene Cfla_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2667 
Symbol 
ID9146571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2967776 
End bp2971282 
Gene Length3507 bp 
Protein Length1168 aa 
Translation table11 
GC content69% 
IMG OID 
ProductDNA-directed RNA polymerase, beta subunit 
Protein accessionYP_003637753 
Protein GI296130503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTGCCT CGCGCATCCC TACTGCACCG TCCGCCGACG CCATCGCGAA CCGCACCGCA 
TCCCGTCGCA TCTCCTTCGC CAAGATCCAC GAGCCGCTCG AGGTCCCCGA CCTGCTCGGT
CTGCAGACCG AGAGCTTCGA CTGGCTGCTG GGCAACGAGC GCTGGCAGGC CCGCGTGGCC
GCCGCGCTCG AGGTCGGTCG CAACGACGTC CCGGAGACCG CCGGCCTGGA GGAGATCTTC
GAGGAGATCT CCCCGATCGA GGACTTCGGC GGGACGATGT CGCTCTCCTT CCGCGAGCAC
CGCTTCGAGC CGCCCAAGTA CACGGCCGAG GAGTGCAAGG AGAAGGACTT CACCTTCGCC
GCGCCGCTGT TCGTCACGGC CGAGTTCGTC AACTACACGA CCGGCGAGAT CAAGTCGCAG
ACCGTCTTCA TGGGTGACTT CCCCCTGATG ACCGAGCGCG GCACCTTCAT CATCAACGGC
ACCGAGCGCG TCGTCGTCTC GCAGCTCGTC CGCTCGCCCG GCGTGTACTT CGAGCGCACG
GCCGACAAGA CGTCCGACAA GGACGTCCTC ACGGCCAAGG TCATCCCGAG CCGTGGCGCG
TGGCTCGAGT TCGAGATCGA CAAGCGCGAC AACGTCGGCG TGCGCGTCGA CCGCAAGCGC
AAGCAGAACG CCACCGTGCT CCTCAAGGCG CTCGGCATGA CGGAGAGCGA GATCCGCGAG
GAGTTCGCGG AGTACCCCGC CGTCATCGAC ACGCTCGAGA AGGACCACGT CCAGACGCAG
GACGAGGCGC TGCTCGACCT CTACCGCAAG ATCCGCCCGG GTGAGCCGCC GACCGTCGAG
GCCGGCCGTG CGCTGCTCGA GAACTTCTAC TTCAACCCCA AGCGCTACGA CCTCGCCAAG
GTCGGCCGCT ACAAGCTGAA CAAGAAGCTC GGCCAGGACG CGCCGCTGTC CGACTCGGTC
CTCGCGCTCT CGGACGTCGT CGCGACGATC AAGTACCTCG CGGCCCTGCA CATCGACAAG
CCGACGCTGC CCGGCACGCG CGGCGGTCAG GCGGTCGAGA TCCGGGTCGA GCCCGACGAC
ATCGACCACT TCGGCAACCG GCGCATCCGC GCCGTCGGCG AGCTCATCCA GAACCAGGTC
CGCACGGGCC TGTCGCGGAT GGAGCGCGTC GTGCGCGAGC GCATGACGAC GCAGGACGTC
GAGGCCATCA CGCCGCAGAC GCTCATCAAC ATCCGCCCCG TCGTGGCCTC CATCAAGGAG
TTCTTCGGGA CGAGCCAGCT GTCGCAGTTC ATGGACCAGA ACAACCCGCT CGCGGGCCTG
ACGCACAAGC GGCGTCTGTC GGCCCTCGGC CCGGGTGGTC TGTCCCGCGA CCGCGCCGGC
ATGGAGGTCC GTGACGTCCA CACCTCGCAC TACGGCCGCA TGTGCCCGAT CGAGACCCCC
GAGGGGCCGA ACATCGGTCT GATCGGCTCG CTCGCGACGT ACGGACGGAT CAACCCGTTC
GGCTTCGTCG AGACGCCGTA CCGCCGCGTC GTCGACGGCA AGGTCACCGA CGAGGTCGAC
TACCTGACCG CGGACGACGA GGACCGGCAC GTCATCGCCC AGGCGAACGC GCCGCTGAAC
GCGGACGGCT CCTTCACCGA GTCGACCGTC CTCGTGCGGA CCAAGGGCGG CGAGCCGGAC
CTCGTGCCCG GCGCGAACGT CGACTACATG GACGTCTCGC CGCGCCAGAT GGTGTCGGTC
GCCACCGCGC TCATCCCGTT CCTCGAGCAC GACGACGCCA ACCGCGCGCT CATGGGCGCC
AACATGCAGC GCCAGGCGGT GCCGCTGGTC CGTTCCGAGG CGCCGCTCGT CGGTACCGGC
ATGGAGCGTC GTGCGGCCGT CGACGCCGGC GACGTGGTCG TGGCGACCAA GGCCGGCGTG
GTCACCGAGG TGTCGGCCGA CCTGGTCACC GTCGCCAACG ACGACGCGAC CACGTCGACG
TACCGCATCG CGAAGTTCCG CCGCTCGAAC CAGGGCACCT GCTACAACCA GCGCGTGCTG
GTCGAGCACG GCGCCCGCGT GGAGCCCGGC TCGGTGCTCG CGGACGGCCC GGCGACGGAC
GAGGGCGAGC TCGCGCTCGG CCGCAACCTG CTCGTCGCGT TCATGTCGTG GGAGGGCCAC
AACTACGAGG ACGCGATCAT CCTGTCGCAG CGCCTCGTGC AGGACGACGT CCTGTCCTCG
ATCCACATCG AGGAGCACGA GGTCGACGCG CGCGACACCA AGCTCGGCCC CGAGGAGATC
ACGCGGGACA TCCCGAACGT CTCCGAGGAG GTCCTGGGCG ACCTCGACGA GCGCGGGATC
ATCCGCATCG GTGCCGAGGT CGCGGCGGGC GACATCCTCG TCGGCAAGGT CACGCCCAAG
GGCGAGACCG AGCTGACCCC CGAGGAGCGC CTGCTGCGCG CCATCTTCGG CGAGAAGGCG
CGCGAGGTCC GCGACACGTC GCTCAAGGTG CCCCACGGCG AGTCCGGCAC GGTGATCGAG
GTGCGCACGT TCAGCCGCGA CGACGGCGAC GAGCTGCCCG CCGGCGTCAA CGAGCTGGTC
CGCGTGTACA TCGCGCAGCG CCGCAAGATC ACCGACGGCG ACAAGCTCGC CGGCCGTCAC
GGCAACAAGG GCGTCATCTC CAAGATCCTG CCCGTCGAGG ACATGCCGTT CCTCGAGGAC
GGGACGCCGG TCGACGTCGT CCTCAACCCG CTGGGCGTCC CCGGGCGCAT GAACGTCGGC
CAGGTGCTCG AGACCCACCT CGGCTGGGTC GCCAAGCAGG GCTGGGACAT CGAGCTCGCC
GAGGGCGAGG CGACGTGGCG GGACGGCGTG CCTGCCGTCG CGGCGCGCTC GACCCCGGGC
AACCCGGTCG CCACCCCCGT GTTCGACGGT GTGCCCGAGG AGACCCTCAC CGGTCTGCTC
AGCACCACGC TGCCCAACCG GGACGGCGAG CGGACGGTCA AGGGTGACGG CAAGGCGCGG
CTGTTCGACG GACGCTCCGG CGAGCCGTTC CCGGAGCCGG TGTCCGTCGG CTACATGTAC
ATCCTCAAGC TGCACCACCT CGTGGACGAC AAGATCCACG CCCGGTCGAC CGGCCCGTAC
TCGATGATCA CGCAGCAGCC GCTGGGTGGT AAGGCGCAGT TCGGTGGCCA GCGGTTCGGC
GAGATGGAGG TGTGGGCCCT GGAGGCGTAC GGCGCCGCCT ACACGCTGCA GGAGCTGCTC
ACCATCAAGT CCGACGACGT CCCGGGCCGC GTCAAGGTCT ACGAGGCGAT CGTCAAGGGC
GAGAACATCC CGGACTCCGG TATCCCGGAG TCGTTCAAGG TCCTGCTCAA GGAGATGCAG
TCGCTCTGCC TGAACGTCGA GGTGCTGTCG TCCGACGGCG TCTCGATCGA CATGAAGGAG
AACGACGACG AGGTCTACCG CGCCGCGGAA GAGCTCGGCA TCGACCTGTC GCGGCGCCCG
AACGCCAGCA GCATCGAGGA GATCTGA
 
Protein sequence
MAASRIPTAP SADAIANRTA SRRISFAKIH EPLEVPDLLG LQTESFDWLL GNERWQARVA 
AALEVGRNDV PETAGLEEIF EEISPIEDFG GTMSLSFREH RFEPPKYTAE ECKEKDFTFA
APLFVTAEFV NYTTGEIKSQ TVFMGDFPLM TERGTFIING TERVVVSQLV RSPGVYFERT
ADKTSDKDVL TAKVIPSRGA WLEFEIDKRD NVGVRVDRKR KQNATVLLKA LGMTESEIRE
EFAEYPAVID TLEKDHVQTQ DEALLDLYRK IRPGEPPTVE AGRALLENFY FNPKRYDLAK
VGRYKLNKKL GQDAPLSDSV LALSDVVATI KYLAALHIDK PTLPGTRGGQ AVEIRVEPDD
IDHFGNRRIR AVGELIQNQV RTGLSRMERV VRERMTTQDV EAITPQTLIN IRPVVASIKE
FFGTSQLSQF MDQNNPLAGL THKRRLSALG PGGLSRDRAG MEVRDVHTSH YGRMCPIETP
EGPNIGLIGS LATYGRINPF GFVETPYRRV VDGKVTDEVD YLTADDEDRH VIAQANAPLN
ADGSFTESTV LVRTKGGEPD LVPGANVDYM DVSPRQMVSV ATALIPFLEH DDANRALMGA
NMQRQAVPLV RSEAPLVGTG MERRAAVDAG DVVVATKAGV VTEVSADLVT VANDDATTST
YRIAKFRRSN QGTCYNQRVL VEHGARVEPG SVLADGPATD EGELALGRNL LVAFMSWEGH
NYEDAIILSQ RLVQDDVLSS IHIEEHEVDA RDTKLGPEEI TRDIPNVSEE VLGDLDERGI
IRIGAEVAAG DILVGKVTPK GETELTPEER LLRAIFGEKA REVRDTSLKV PHGESGTVIE
VRTFSRDDGD ELPAGVNELV RVYIAQRRKI TDGDKLAGRH GNKGVISKIL PVEDMPFLED
GTPVDVVLNP LGVPGRMNVG QVLETHLGWV AKQGWDIELA EGEATWRDGV PAVAARSTPG
NPVATPVFDG VPEETLTGLL STTLPNRDGE RTVKGDGKAR LFDGRSGEPF PEPVSVGYMY
ILKLHHLVDD KIHARSTGPY SMITQQPLGG KAQFGGQRFG EMEVWALEAY GAAYTLQELL
TIKSDDVPGR VKVYEAIVKG ENIPDSGIPE SFKVLLKEMQ SLCLNVEVLS SDGVSIDMKE
NDDEVYRAAE ELGIDLSRRP NASSIEEI