Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0165 |
Symbol | |
ID | 9144031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 197910 |
End bp | 200456 |
Gene Length | 2547 bp |
Protein Length | 848 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | protein of unknown function DUF404 |
Protein accession | YP_003635283 |
Protein GI | 296128033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.089788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00349746 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGACC TGCTCTCGTC GCCGCGCCCC GTGGGCGACG GGACGGTCGT GCACCGCCCG GACTGGACGT GGCTGCCGCC CGGCACGGCC CCCACGGAGG CCGACCTCGC GCGCGCCGCC CGCCAGGCCG AGCAGCTCCT CGCCGCGCAC GGGGTCACCT ACGGTGCCGA GGCGGTGGAC GGCGACCACC CCTGGCGCCT GGACCCCGAG CCCGTCGTCG TCGACGAGCC CGAGTGGACG CGCCTCGAGG CCGCGCTCAC GCAGCGTGCG GAGCTGCTCG ACGCGGTCCT GCACGACCTG TACGGCCCGC GTCGCCTGCT CGACGACCGC ATCCTGCCGC CGACGACCGT GCTCGCCCAC CCCGGGTTCC TGCGCGCCGT CGACGGGCTG CGGCTGCCCG GGGGGCGCGA GCTGGTCCTG TCCGCCACCG ACCTCGTGCG CGACCGCGCC GGGGAGTGGT GCGTCGTCGC CGACCGCACC CAGGCGCCGT CGGGCGCCGG GTACGCGATG GAGGACCGGC GCATCACCGC GCAGGTGCTC GCGCCCGTCT ACCGGCAGGC GCCCATCGCG CGGCTCGGGC CGTTCTTCCA CGCGCTGCGC AAGGCCCTGC GCGAGGTCGC GCCGCCCACC GCGGGCGACG AGCCGCGTGC GGTCCTGCTC TCCCCCGGCC CCGCGAGCGA GACGGCGTTC GACCAGGCGT ACCTCGCCTC GATGCTCGGT CTCCCGCTCG TCGAGGGCAG CGACCTCGTG GTGCGGGCGG GCCGCGTGTA CCTGCAGGGC ATCGACGGGC TCGAGCACGT CGACGTCGTG CTGCGGCGCG TCGACGGCGA CTGGTGCGAC CCGCTCGACC TGCGCGCCGG CTCCCGCCTC GGCGTGCCCG GGCTCGTGCA CGCCGTGCGG CAGGGCACCG TGAGCGTCGT CAACCCGCTC GGCACGTCCG TGCTGGACAA CCCCGCGCTG CTCGCGTACC TGCCGCGGCT CGCGCGCGCC GTGCTCGACC AGGACCTCAC GCTGGCGTCC GCGCCCACGT GGTGGTGCGG CGAGGAGCGC GCGCTGCGGC ACGTGCTCGG GCGCCTCGAC CGGCTCGTGC TCAAGCCGGT CGTGCACGGT GCGGAGAGCA CGACCGTGGT CGGCGAGCGG CTGAGCGCGG CCGAGCGCGA GGACCTCGCG GCGCGCATCA CCGCCGAGCC GTGGCGGTGG GTCGGCCAGG AGCGCGTCGG ACCGGAGGAG CCCGGGTCCC GCGCCGCGGT CCTGCGGACC TTCGCGGTCG CGCACGCCGG GTCGTACACC GTGATGTCCG GGGGGCTTGC GCGCGTCGCC GACGACCCCG TGGTCACGTC GTCCGCCCCG GGCGCGGTGG CCAAGGACGT CTGGGTGCTC ACCTCCCGGC CCGCCGCGAC GGGCGCGGTG CTGCGGGAGG ACGACGCCGC GGCCGGTGGC CGCACGCTCG CCTACGGCAT CTCGCCGCGC GCCGCCGAGA ACCTCTACTG GATGGGCCGC TACGCCGAGC GCGCCGAGGA CGGTGTGCGC GTGCTGCGGG CCGTCGCCGA CCGGTGGGAC GACTACCACC GCACGCCCGG CACGGCGGGC GGCCAGGCCC TGGCGGTGCT CCTGCAGGCG CTCACCCCCG CGGCCCTGCC CGACGGCGGC GAGGCCGCGG TCCCCGCGCC CGAGCACGTC GGACCGCGCG TGCCGGCGCT GCGCGACCTG CTGCTCGACC GCCGCACCCC CGGCTCCGTC GCACGTGCCG TGCACCGGCT GCGCACCTCG GCCGCGACCG TGCGCGACCA GCTCTCCACC GACACGTTCG GCCCGATCGC GCGCATCGAG AGCACGCTGC GCGACGAGCG CGCGCGGCTG CGGGCGCGCC GGCAGCCCGA CGCCGGTCTC GCGGCGCCCG CGTCCGTCAC CGCGGGTCTG CGCCCCACGC TCGACGGCGT CCTGGAGAGC CTGCTCGCGA TCTCCGGCAT CGCCGCCGAG GGCCTCACAC GGGACGTCGG GTGGCACCTG CTCGACGCGG GCCGCCGCAT CGAGCGCGCG CAGCGGCTCG TGGCCATGCT CCGGGCGACC CTCGTCGAGC ACCGGCCCGC CGAGGTCGAG GACCTGCTGC TCGAGTCCGT GCTGCTGGCC ACCGAGTCCG CGATCACCTA CCGCCGTCGG CACCAGTCGC GCACCGACGT CGCGCGCGTC CTCGACCTCC TCGTCCACGA CCGCACCAAC CCGCGCTCGC TGGCGTTCGC GCTCGACCGG CTGCTCGCGG ACCTCGAGGC GGTGCCCGCG CCGCGGTCCG CCACGCAGCG CGACCACCTG CTGCACGGCG TCGCCGGGCT CGTCGCCGAG CTCGACACCG TCGTCGTCGG CAACGAGGTG TCCGACGACG GGCGGCGCGT ACGGCTCGCC GACGCGCTGG ACTCGATGCT GTGGCGGCTG CGCGAGGCGT CCGACGAGAT CGAGCGCGTG CACTTCGTGC GGCCCGCGCC GAGCCGGGCG CTCGACGACG TGTGGGGCGC GGGCACGGAC GACCGGACCG GGGAGGAGGA GCGGTGA
|
Protein sequence | MTDLLSSPRP VGDGTVVHRP DWTWLPPGTA PTEADLARAA RQAEQLLAAH GVTYGAEAVD GDHPWRLDPE PVVVDEPEWT RLEAALTQRA ELLDAVLHDL YGPRRLLDDR ILPPTTVLAH PGFLRAVDGL RLPGGRELVL SATDLVRDRA GEWCVVADRT QAPSGAGYAM EDRRITAQVL APVYRQAPIA RLGPFFHALR KALREVAPPT AGDEPRAVLL SPGPASETAF DQAYLASMLG LPLVEGSDLV VRAGRVYLQG IDGLEHVDVV LRRVDGDWCD PLDLRAGSRL GVPGLVHAVR QGTVSVVNPL GTSVLDNPAL LAYLPRLARA VLDQDLTLAS APTWWCGEER ALRHVLGRLD RLVLKPVVHG AESTTVVGER LSAAEREDLA ARITAEPWRW VGQERVGPEE PGSRAAVLRT FAVAHAGSYT VMSGGLARVA DDPVVTSSAP GAVAKDVWVL TSRPAATGAV LREDDAAAGG RTLAYGISPR AAENLYWMGR YAERAEDGVR VLRAVADRWD DYHRTPGTAG GQALAVLLQA LTPAALPDGG EAAVPAPEHV GPRVPALRDL LLDRRTPGSV ARAVHRLRTS AATVRDQLST DTFGPIARIE STLRDERARL RARRQPDAGL AAPASVTAGL RPTLDGVLES LLAISGIAAE GLTRDVGWHL LDAGRRIERA QRLVAMLRAT LVEHRPAEVE DLLLESVLLA TESAITYRRR HQSRTDVARV LDLLVHDRTN PRSLAFALDR LLADLEAVPA PRSATQRDHL LHGVAGLVAE LDTVVVGNEV SDDGRRVRLA DALDSMLWRL REASDEIERV HFVRPAPSRA LDDVWGAGTD DRTGEEER
|
| |