Gene Cfla_3394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3394 
Symbol 
ID9147310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3774682 
End bp3777618 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content78% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003638471 
Protein GI296131221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000625659 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCACGTA CCTCAGCGCC GCCGCGGCTC CGGCTCCTCG GTACCCCGTC CGCGCTGGTC 
GCCGACGGTG AGGTGGACCT GGGCGCGCCC AAGCAGCGCG CCGTCCTCGT CGCCCTCGCG
CTGCGCGCCG GGCAGGCGGT CGGGCACGAC ACCCTCGTCG ACGGGACGTG GGGCGAGGGC
GCGCCCGCCA GCGCGCGGGG CAGCCTCCAC ACCTACGTCT CGGGGCTGCG CCGCGTGCTC
GGCCCCGACG TGCTGCGCAG CACGCCCACC GGATACCTGC TCGACGTCCC GGCCGCCGCG
GTCGACGCGC TCGTCGTCGA GCAGCACGCC CGCCGGGCGC GCGAGGCCCA CGAGGCGTCC
GACCTGCACG CCGCGCTGAG CGCGCTCGAC GCGGCGCTCG ACCTGTGGCC GTCGGGCGAC
GTGCTCCTCG GGGTCCCGGG CCCGTTCGCG GCCGACCAGC GCACGCGACT GGCCGGGCTG
CGCGTGCGGA TGCTCGTCGA GCGCACCGAG GTGGCCGTCG CCGCGGGCGC GGACCCGGCG
TCCCTCGCCG ACGCGGCCGA CAGGCTCGCC GCCGAGGTCG CCGGGCACCC GTACGACGAG
CGCCTGCGGT GCGCGCTCAT GGCCGCGCTG CACGGCTCCG GGCGCACCGC CGCGGCCCTC
GCGCAGTACG ACGACCTGCG TCGGGCCCTG CGCGGCGAGC TCGGCATCGA CCCGGGCGCG
GCGACGCGTG CGCTGCACGC GCGGATCCTC GCCGACCCCC GCGAGCGCCC CGCCGCACGG
CCCGCGCCCG CGCACCCGGC GGCGTCCACC TGGCCCCCCG GGTCGCTGTC GCTCGGCGCG
CCGCGCACGG CGCCCGACGG CCCGGCCCCG ATCCCGCCCC CACGTCCCCG CCCCGCCCCG
CCCCCGCGAC CGGCGGTCGA CGCGCACGTG CTCCCCGCCC AGCTGCCGCC CGACCTGACC
GCGTTCGTGG GCCGCGCGCG CGAGCTCGTC GAGGTGCTGC GGGCGGCGGG CCCGGACGGG
CCGCGGGTCG TGACGGTCGT GGGTGTCGGC GGCGTCGGCA AGACGACGCT CGCGGTGCGG
GCCGGCCACA TGCTGCGCGA CCGGTTCGCC GACGGGCAGC TCTACGTGAA CCTGCGCGGC
TTCGACCCCC GGCACCCGCC CGTGGACCCG ACCGCGGCGC TGCGGCAGCT GCTCGCGGGC
CTCGGTGTGC TGTCGGCACC GCAGCAGCAC GACGAGGTCG TCGCGCTGTG GCGCAGCATG
GTGGCGGACC GCCGCCTGCT CGTCGTCCTC GACAACGCCG CGTCGACCGA GCAGGTCGAG
GACCTGCTGC CCGGGTCGGC GTCGTGCTTC GTGGTCGTGA CGAGCCGCGA GCGCCTCGGC
GGGCTCGCGG TGCGGCACGG CGCGCGCAGC GTGCGGCTCG CGCGCTTCGG GCCGGCGGAG
GCCCGTGAGC TGCTGGAGGG TGCGCTCGGC GCCGACCTCG TCGCGCGCGA GACGCACGCC
GCGGGCCGGC TGGTCGAGCT GTGCGACGCG CTGCCGTTCG CGCTGCGCAT CGCGGCCGAG
CAGGTCCACA CCGGGCGCGG GTCGACGATC GGGGCGATGG TCACGCGGCT GGAGGACTCG
CGGCACCGTC TCGACGCGCT CGACCTGGAC GACGGCCCGT CGGCGTCGGT GCGGGGCGTG
CTGGCGACGT CGACCGCCGC ACTGGACCCC GAGCAGCTGC GCACGCTGTG CCTGCTCGCG
GCCCTGCCGT GCCAGAGCAC GACAGTGCGG GCGACCGCCG CGCTCGTCGA CGTCGAGCCC
GAGCGCGCGG TGCGCCTGCT GACCGACCTG TGCGAGCACC ACCTGCTCGA GGTCGCCGAC
GGCCGGTACG TCATGCACGA CCTGACGCGT GCGCACGCCG CCGAGATCGC CGGCCGGATC
CCGGACGACG AGCGCGCCGC GGCCCGCCGC CGCCTGCTCA CCTGGTACGT GTGCGTGCTG
GCCGCGAACA CCCACCACCG GCTCCTGCAG TTCGAGCCGC CGACGCCGCG GCACGAGGTG
CCGGCGCTGC CCGACGGTGC GGCGCTGCTG CGCTGGACCC TCGCCGAGCT CGCCAACCTC
ACGGCGCTGC TCCACGAGGG GCACGCGCAC GGCGACCACG AGCTCGTGTG GCAGGCGGTC
GTGCTGATGT TCGAGACGTA CTACGCGGCC GCGGGCTCCA CGGAGTGGCT GGCCGTGCTG
CGGGTCGCGG CGCGCTCGGC GCGTGCGCTG GGCGACACGC GCGCGCTCGC GGTGCTGCTG
AACCACGAGA GCGTCGCGTG CTCGCGCCTC GGCCGCAACG ACGCGGCCGT CGCGCGCCTG
CGCGAGGCGC TCGACCTGCT CGACGGCGAC CGCTGGTGGT ACCGCGTGAG CGTCGTCAGC
AACCTCGCCT CGACGCTGCG CGAGGCCAAG GAGTACGACG CGGCGCTCGC GGCCGCGCAC
GACGGGCACG CGCTCGCGGT CGAGCTGGGC GACGGCTACT ACCAGGTCGC GTCGGGCGAC
GTGCTGTGCG AGCTGTACGC CGAGCTCGGC GACTGGCGGG CGGCGGCGCT GCACGGGGAG
CGCGCGCTGG CGGTCGCGCA GGCCGAGGGG CACCAGGTGC TGGAGGCGAA CCTGCTGGTC
AACCTGGGTG TCGCGGCCGC CGGGCTGGGG CAGCACGACG CCGCGCAGGA CCGGTTCTCC
CAGGCGCTCG CCCTGTGCGC GCAGCTCGGC GACCGGTACC ACGAGGGGCT CGCGCTGTTC
GGGCTGGCGC GCCTGCGCGC GGCACGCGAC GGCCCGGCGG GCGAGGCGGC GGCGCGGCGG
GACGCGCAGG CCGCGATGGA CCGCTTCCGG CAGCTCGGTG CCGAGGAGGC GGGCTCGGTG
GCGATCTTCC TCGCCGGGCT GTCGGTCGGC GTCGACGACG TGCTGCGCCA CGGCTAG
 
Protein sequence
MPRTSAPPRL RLLGTPSALV ADGEVDLGAP KQRAVLVALA LRAGQAVGHD TLVDGTWGEG 
APASARGSLH TYVSGLRRVL GPDVLRSTPT GYLLDVPAAA VDALVVEQHA RRAREAHEAS
DLHAALSALD AALDLWPSGD VLLGVPGPFA ADQRTRLAGL RVRMLVERTE VAVAAGADPA
SLADAADRLA AEVAGHPYDE RLRCALMAAL HGSGRTAAAL AQYDDLRRAL RGELGIDPGA
ATRALHARIL ADPRERPAAR PAPAHPAAST WPPGSLSLGA PRTAPDGPAP IPPPRPRPAP
PPRPAVDAHV LPAQLPPDLT AFVGRARELV EVLRAAGPDG PRVVTVVGVG GVGKTTLAVR
AGHMLRDRFA DGQLYVNLRG FDPRHPPVDP TAALRQLLAG LGVLSAPQQH DEVVALWRSM
VADRRLLVVL DNAASTEQVE DLLPGSASCF VVVTSRERLG GLAVRHGARS VRLARFGPAE
ARELLEGALG ADLVARETHA AGRLVELCDA LPFALRIAAE QVHTGRGSTI GAMVTRLEDS
RHRLDALDLD DGPSASVRGV LATSTAALDP EQLRTLCLLA ALPCQSTTVR ATAALVDVEP
ERAVRLLTDL CEHHLLEVAD GRYVMHDLTR AHAAEIAGRI PDDERAAARR RLLTWYVCVL
AANTHHRLLQ FEPPTPRHEV PALPDGAALL RWTLAELANL TALLHEGHAH GDHELVWQAV
VLMFETYYAA AGSTEWLAVL RVAARSARAL GDTRALAVLL NHESVACSRL GRNDAAVARL
REALDLLDGD RWWYRVSVVS NLASTLREAK EYDAALAAAH DGHALAVELG DGYYQVASGD
VLCELYAELG DWRAAALHGE RALAVAQAEG HQVLEANLLV NLGVAAAGLG QHDAAQDRFS
QALALCAQLG DRYHEGLALF GLARLRAARD GPAGEAAARR DAQAAMDRFR QLGAEEAGSV
AIFLAGLSVG VDDVLRHG