Gene Cfla_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1947 
Symbol 
ID9145841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2164458 
End bp2166470 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content73% 
IMG OID 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003637041 
Protein GI296129791 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.122446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATC CCGCGACCTA CCGTCCCGCC CCGGGGGAGA TCCCGGACGC CCCGGGCGTC 
TACCGCTTCC GTGACGAGCA CGGCCGCGTG GTCTACGTCG GCAAGGCCAA GAGCCTGCGC
AACCGGCTCA ACAGCTACTT CCAGGATCTC GCCAACCTGC ACCCGCGGAC GCAGCAGATG
GTGACGACGG CCGCCGCCGT CCAGTGGACG GTCGTCGGCA CGGAGGTCGA GGCCCTCGCG
CTCGAGTACT CGTGGATCAA GGAGTTCGAC CCGCGGTTCA ACGTCAAGTA CCGCGACGAC
AAGTCCTACC CCTACCTCGC GGTGACCATG GCCGACGAGG TCCCGCGCGT CCAGGTGATG
CGCGGGGCCA AGCGGCGCGG CACGCGGTAC TTCGGCCCGT ACGCCCACGC GTGGGCCATC
CGCGAGACCG TGGACCTGCT GCTGCGCGTG TTCCCGGTGC GCACGTGCTC CGCCGGGGTG
TTCCGGCGTG CGCGCCAGCA GGGCCGCCCG TGTCTCCTCG GGTACATCGA CAAGTGCTCG
GCGCCGTGCG TGGGACGCAT CGGCATCGAG GAGCACCACC AGCTCGCGCA GGACTTCTGC
GACTTCATGG CGGGGGACAC CGGCCGATTC ACGCGGCGCC TGACCAGGGC GATGAAGGAC
GCTGCCGCCG AGATGGACTA CGAGCGTGCG GCCCGACTGC GCGACGACGT CCGCACCCTC
GAGAAGGCGA CGGAGCGCAA CGCCGTCGTG CTCTCGGACG GCACCGACGC CGACGTGTTC
GCCCTGGCCG GGGACGAGCT CGAGGCCGCC GTCCAGGTGT TCCACGTCCG CGACGGGCGC
ATCCGGGGCC AGCGCGGCTG GGTCGTCGAG AAGGTCGAGG ATCTCGACGA CGCCGAGCTC
GTCGAGCACC TGCTGCAGCA GGTCTACGGC GCCGAGGACC CCGAGGCCGC GACCGCCTCG
GTGCCGCGCG AGGTCCTGGT GCCGGTGCTG CCGCCCGACG TGGAGCAGGT GCAGGCATGG
CTGACCGGCC TGCGCGGCAG CCGCGTCCAG GTGCGCGTCC CGCAGCGCGG CGACAAGCGC
GAGCTCGCCG CGACCGTGCT GCGCAACGCC GAGCACGCGC TCGCGCTGCA CCGCACCCGC
CGCGCCGGTG ACCTCACCAG CCGCAGCATG GCGCTGCGGG AGATCCAGGA GGCCCTCGAC
CTGCCGAGCG CCCCGCTGCG GATCGAGTGC TACGACGTCT CCCACAACCA GGGCACGTAC
CAGTCGGCCT CGATGGTCGT CTTCGAGGAC GGTCTGGCGC GCAAGAGCGA GTACCGGCTG
TTCACCGTGC GCGGCCCCGA GGGGCAGGGT GCTCGCGACG ACACCGCGGC GATGCACGAG
GTCATCACGC GACGCTTCCG GCGCTACCTC GCCGAACGTG CCGACTCGCA CGACCCCGAG
CTGGGCGACG CGGAGGAGGA CGACGCACCG CGCACGGGGC CCGTGGACGA GCGCACGGGT
CGGCCCGCGC GCTTCGCCTA CCCGCCGAAC CTCGTCGTCG TCGACGGCGG CCCCCCGCAG
GTCGCGGCGG CGGCCGCCGC GCTGGCCGAG CTCGGCATCG ACGACGTCGC CCTGTGCGGG
CTCGCCAAGC GGCTCGAGGA GGTCTGGCTG CCGGGGGAGG AGTACCCCGT CATCCTGCGG
CGCGCCTCCG AGGGGCTCTA CCTGCTGCAG CGTGTGCGCG ACGAGGCCCA CCGCTTCGCG
ATCACCGCGC ATCGCAAGCG GCGCAGCAAG GGCATGACCG CCTCCGTGCT CGACGACGTG
CCCGGCCTCG GCCCGGCCCG CAAGGCGGCG CTGCTGCGGC ACTTCGGCTC CGTCAAGAGG
CTGCGTGCGG CGACCGCGGA GGAGATCGCG ACGATCCCGG GCATGGGAGC ACGCACGGCC
GAGGCGGTCG TCGCGGCGCT GGCGGTACCG CCCGCGACGC CCGGCGGGTC CGGTGACACG
CCCGCGCAGG AGCCTGGCAT CCTGGGGGCA TGA
 
Protein sequence
MADPATYRPA PGEIPDAPGV YRFRDEHGRV VYVGKAKSLR NRLNSYFQDL ANLHPRTQQM 
VTTAAAVQWT VVGTEVEALA LEYSWIKEFD PRFNVKYRDD KSYPYLAVTM ADEVPRVQVM
RGAKRRGTRY FGPYAHAWAI RETVDLLLRV FPVRTCSAGV FRRARQQGRP CLLGYIDKCS
APCVGRIGIE EHHQLAQDFC DFMAGDTGRF TRRLTRAMKD AAAEMDYERA ARLRDDVRTL
EKATERNAVV LSDGTDADVF ALAGDELEAA VQVFHVRDGR IRGQRGWVVE KVEDLDDAEL
VEHLLQQVYG AEDPEAATAS VPREVLVPVL PPDVEQVQAW LTGLRGSRVQ VRVPQRGDKR
ELAATVLRNA EHALALHRTR RAGDLTSRSM ALREIQEALD LPSAPLRIEC YDVSHNQGTY
QSASMVVFED GLARKSEYRL FTVRGPEGQG ARDDTAAMHE VITRRFRRYL AERADSHDPE
LGDAEEDDAP RTGPVDERTG RPARFAYPPN LVVVDGGPPQ VAAAAAALAE LGIDDVALCG
LAKRLEEVWL PGEEYPVILR RASEGLYLLQ RVRDEAHRFA ITAHRKRRSK GMTASVLDDV
PGLGPARKAA LLRHFGSVKR LRAATAEEIA TIPGMGARTA EAVVAALAVP PATPGGSGDT
PAQEPGILGA