Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3394 |
Symbol | |
ID | 9147310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3774682 |
End bp | 3777618 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003638471 |
Protein GI | 296131221 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000625659 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCACGTA CCTCAGCGCC GCCGCGGCTC CGGCTCCTCG GTACCCCGTC CGCGCTGGTC GCCGACGGTG AGGTGGACCT GGGCGCGCCC AAGCAGCGCG CCGTCCTCGT CGCCCTCGCG CTGCGCGCCG GGCAGGCGGT CGGGCACGAC ACCCTCGTCG ACGGGACGTG GGGCGAGGGC GCGCCCGCCA GCGCGCGGGG CAGCCTCCAC ACCTACGTCT CGGGGCTGCG CCGCGTGCTC GGCCCCGACG TGCTGCGCAG CACGCCCACC GGATACCTGC TCGACGTCCC GGCCGCCGCG GTCGACGCGC TCGTCGTCGA GCAGCACGCC CGCCGGGCGC GCGAGGCCCA CGAGGCGTCC GACCTGCACG CCGCGCTGAG CGCGCTCGAC GCGGCGCTCG ACCTGTGGCC GTCGGGCGAC GTGCTCCTCG GGGTCCCGGG CCCGTTCGCG GCCGACCAGC GCACGCGACT GGCCGGGCTG CGCGTGCGGA TGCTCGTCGA GCGCACCGAG GTGGCCGTCG CCGCGGGCGC GGACCCGGCG TCCCTCGCCG ACGCGGCCGA CAGGCTCGCC GCCGAGGTCG CCGGGCACCC GTACGACGAG CGCCTGCGGT GCGCGCTCAT GGCCGCGCTG CACGGCTCCG GGCGCACCGC CGCGGCCCTC GCGCAGTACG ACGACCTGCG TCGGGCCCTG CGCGGCGAGC TCGGCATCGA CCCGGGCGCG GCGACGCGTG CGCTGCACGC GCGGATCCTC GCCGACCCCC GCGAGCGCCC CGCCGCACGG CCCGCGCCCG CGCACCCGGC GGCGTCCACC TGGCCCCCCG GGTCGCTGTC GCTCGGCGCG CCGCGCACGG CGCCCGACGG CCCGGCCCCG ATCCCGCCCC CACGTCCCCG CCCCGCCCCG CCCCCGCGAC CGGCGGTCGA CGCGCACGTG CTCCCCGCCC AGCTGCCGCC CGACCTGACC GCGTTCGTGG GCCGCGCGCG CGAGCTCGTC GAGGTGCTGC GGGCGGCGGG CCCGGACGGG CCGCGGGTCG TGACGGTCGT GGGTGTCGGC GGCGTCGGCA AGACGACGCT CGCGGTGCGG GCCGGCCACA TGCTGCGCGA CCGGTTCGCC GACGGGCAGC TCTACGTGAA CCTGCGCGGC TTCGACCCCC GGCACCCGCC CGTGGACCCG ACCGCGGCGC TGCGGCAGCT GCTCGCGGGC CTCGGTGTGC TGTCGGCACC GCAGCAGCAC GACGAGGTCG TCGCGCTGTG GCGCAGCATG GTGGCGGACC GCCGCCTGCT CGTCGTCCTC GACAACGCCG CGTCGACCGA GCAGGTCGAG GACCTGCTGC CCGGGTCGGC GTCGTGCTTC GTGGTCGTGA CGAGCCGCGA GCGCCTCGGC GGGCTCGCGG TGCGGCACGG CGCGCGCAGC GTGCGGCTCG CGCGCTTCGG GCCGGCGGAG GCCCGTGAGC TGCTGGAGGG TGCGCTCGGC GCCGACCTCG TCGCGCGCGA GACGCACGCC GCGGGCCGGC TGGTCGAGCT GTGCGACGCG CTGCCGTTCG CGCTGCGCAT CGCGGCCGAG CAGGTCCACA CCGGGCGCGG GTCGACGATC GGGGCGATGG TCACGCGGCT GGAGGACTCG CGGCACCGTC TCGACGCGCT CGACCTGGAC GACGGCCCGT CGGCGTCGGT GCGGGGCGTG CTGGCGACGT CGACCGCCGC ACTGGACCCC GAGCAGCTGC GCACGCTGTG CCTGCTCGCG GCCCTGCCGT GCCAGAGCAC GACAGTGCGG GCGACCGCCG CGCTCGTCGA CGTCGAGCCC GAGCGCGCGG TGCGCCTGCT GACCGACCTG TGCGAGCACC ACCTGCTCGA GGTCGCCGAC GGCCGGTACG TCATGCACGA CCTGACGCGT GCGCACGCCG CCGAGATCGC CGGCCGGATC CCGGACGACG AGCGCGCCGC GGCCCGCCGC CGCCTGCTCA CCTGGTACGT GTGCGTGCTG GCCGCGAACA CCCACCACCG GCTCCTGCAG TTCGAGCCGC CGACGCCGCG GCACGAGGTG CCGGCGCTGC CCGACGGTGC GGCGCTGCTG CGCTGGACCC TCGCCGAGCT CGCCAACCTC ACGGCGCTGC TCCACGAGGG GCACGCGCAC GGCGACCACG AGCTCGTGTG GCAGGCGGTC GTGCTGATGT TCGAGACGTA CTACGCGGCC GCGGGCTCCA CGGAGTGGCT GGCCGTGCTG CGGGTCGCGG CGCGCTCGGC GCGTGCGCTG GGCGACACGC GCGCGCTCGC GGTGCTGCTG AACCACGAGA GCGTCGCGTG CTCGCGCCTC GGCCGCAACG ACGCGGCCGT CGCGCGCCTG CGCGAGGCGC TCGACCTGCT CGACGGCGAC CGCTGGTGGT ACCGCGTGAG CGTCGTCAGC AACCTCGCCT CGACGCTGCG CGAGGCCAAG GAGTACGACG CGGCGCTCGC GGCCGCGCAC GACGGGCACG CGCTCGCGGT CGAGCTGGGC GACGGCTACT ACCAGGTCGC GTCGGGCGAC GTGCTGTGCG AGCTGTACGC CGAGCTCGGC GACTGGCGGG CGGCGGCGCT GCACGGGGAG CGCGCGCTGG CGGTCGCGCA GGCCGAGGGG CACCAGGTGC TGGAGGCGAA CCTGCTGGTC AACCTGGGTG TCGCGGCCGC CGGGCTGGGG CAGCACGACG CCGCGCAGGA CCGGTTCTCC CAGGCGCTCG CCCTGTGCGC GCAGCTCGGC GACCGGTACC ACGAGGGGCT CGCGCTGTTC GGGCTGGCGC GCCTGCGCGC GGCACGCGAC GGCCCGGCGG GCGAGGCGGC GGCGCGGCGG GACGCGCAGG CCGCGATGGA CCGCTTCCGG CAGCTCGGTG CCGAGGAGGC GGGCTCGGTG GCGATCTTCC TCGCCGGGCT GTCGGTCGGC GTCGACGACG TGCTGCGCCA CGGCTAG
|
Protein sequence | MPRTSAPPRL RLLGTPSALV ADGEVDLGAP KQRAVLVALA LRAGQAVGHD TLVDGTWGEG APASARGSLH TYVSGLRRVL GPDVLRSTPT GYLLDVPAAA VDALVVEQHA RRAREAHEAS DLHAALSALD AALDLWPSGD VLLGVPGPFA ADQRTRLAGL RVRMLVERTE VAVAAGADPA SLADAADRLA AEVAGHPYDE RLRCALMAAL HGSGRTAAAL AQYDDLRRAL RGELGIDPGA ATRALHARIL ADPRERPAAR PAPAHPAAST WPPGSLSLGA PRTAPDGPAP IPPPRPRPAP PPRPAVDAHV LPAQLPPDLT AFVGRARELV EVLRAAGPDG PRVVTVVGVG GVGKTTLAVR AGHMLRDRFA DGQLYVNLRG FDPRHPPVDP TAALRQLLAG LGVLSAPQQH DEVVALWRSM VADRRLLVVL DNAASTEQVE DLLPGSASCF VVVTSRERLG GLAVRHGARS VRLARFGPAE ARELLEGALG ADLVARETHA AGRLVELCDA LPFALRIAAE QVHTGRGSTI GAMVTRLEDS RHRLDALDLD DGPSASVRGV LATSTAALDP EQLRTLCLLA ALPCQSTTVR ATAALVDVEP ERAVRLLTDL CEHHLLEVAD GRYVMHDLTR AHAAEIAGRI PDDERAAARR RLLTWYVCVL AANTHHRLLQ FEPPTPRHEV PALPDGAALL RWTLAELANL TALLHEGHAH GDHELVWQAV VLMFETYYAA AGSTEWLAVL RVAARSARAL GDTRALAVLL NHESVACSRL GRNDAAVARL REALDLLDGD RWWYRVSVVS NLASTLREAK EYDAALAAAH DGHALAVELG DGYYQVASGD VLCELYAELG DWRAAALHGE RALAVAQAEG HQVLEANLLV NLGVAAAGLG QHDAAQDRFS QALALCAQLG DRYHEGLALF GLARLRAARD GPAGEAAARR DAQAAMDRFR QLGAEEAGSV AIFLAGLSVG VDDVLRHG
|
| |