Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0729 |
Symbol | |
ID | 9144600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 790786 |
End bp | 792366 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_003635839 |
Protein GI | 296128589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00280675 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00149377 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCAG AGCACGGGGC CACGACGCAC GCGCGGGCGG CCGCCCGGCG CCGCACGGGA GCACCGCGGG ACGTCCCCGT GACGACGCCC GCCGCCCCAC CCGCACCGGA GGCCCCACCC CCGCACGCGC TGCGGTGGGA CCCCGCCCGC ATGCACACCC GCGGCCCGCG ACGGCCCGTG TGGCTCGTGC GGTTCCACGC GCTGCTCATC GCGAACGACA CCGCCGTGGT GGTCGCCGCG ACCGCCCTCG GCGCGTGGCT GTGGGGCGGC CCGCGGCCCG TGACGTTCTT CGGCGCGCCG GTGCCCACCG TGTCGTGGCT CGCCGCCGTG GTCGCGATCT GGCTGGTCGC GCTCGCGGCC GTGCGCTCGC GGTCCGAGCT GATCCTCGCC GTCGGCGTCA CCGAGCTGCA GCGCGTCCTC AACGCGTCGG TGTTCGCGCT CGCGGCCGTC ATGAGCACCG CGTACCTGGG CGACGCGCAG ATCCCGCGCG GCACGCTCGC CGGAGCGTTC GGCTCGGGTC TGCTCGGGCT CATGGTGACG CGCCTGGCGT GGCGGCACCG GCTCATCGCG TGGCGCGCCG GCGGGCGGTG CAAGCGCAAC GCGCTGCTCG TGGGGCCGCA CCGGGACGTC GTGCGGCTGC TCGGGGACCT GCGGCGCAAC CACCGCGCCG GGTTCCGGGT GGTCGGCATC GCGCTCACCG ACGTCGACCC GGCACCCGAC GCGCGCATCG ACGACGTCGA GACCTTCGGG CTCGAGGAGC TGGTCGACCG CGCGCACCAC CCGCGCGTCA CCAGCGTCGT GCTCGCCGGC GACCTTCCCG GCGGGCGCGC CGCGATCCGC CGGCTCGGGT GGTCCCTGGA GGGCGCCGCG ACCGAGCTCG TCCTGCCGAG CCGCCTGACG TACGTCGCCG GGCCGCGCAT CCACCTGCGG CCCGTCGAGG GCATGCCGCT GGTGCACCTG TCGCTGCCCA CGTACACGGG CGTCGCGCAC GTCGCCAAGC GCGGCGTCGA CGTGGTCGTC GCGTCGCTCG CGCTCGTCGT GCTCCTCCCC GCGCTGCTCG CCGTGGCCGT CGCGATCAAG CTCGACGACG GCGGGCCCGT GCTGTTCCGT CAGGAGCGCG TCGGCAACCG TGAGCAGCTG TTCACCATGT ACAAGTTCCG CACGATGGTG GTCGACGCCG AGGCGCGGCT CGCGGCGCTG CAGGAGCGCA ACCAGGGCGC GGGCGTGCTG TTCAAGATGA CCGACGACCC GCGCGTCACG CGCGTGGGCC GCGTGCTGCG CGCCTGGTCG CTCGACGAGC TGCCGCAGTT CCTCAACGCG CTGCTCGGGA CCATGTCGGT CGTCGGGCCG CGGCCCCCGC TGCCGCGCGA GGTCGCGCTC TACGACGGCG ACGTCCACCG GCGCCTGCTG TCCAAGCCGG GGATCACCGG CCTGTGGCAG GTCAGCGGGC GGTCCGACCT GACGTGGGAG GAGAGCGTGC AGCTCGACCT GTCCTACGTC GAGAACTGGT CGCTGTCCGG GGACCTCATG ATCATCCTGC GCACGTTCCG CAGCGTGCTG GCGCGTGCCG GGGCGTACTG A
|
Protein sequence | MTAEHGATTH ARAAARRRTG APRDVPVTTP AAPPAPEAPP PHALRWDPAR MHTRGPRRPV WLVRFHALLI ANDTAVVVAA TALGAWLWGG PRPVTFFGAP VPTVSWLAAV VAIWLVALAA VRSRSELILA VGVTELQRVL NASVFALAAV MSTAYLGDAQ IPRGTLAGAF GSGLLGLMVT RLAWRHRLIA WRAGGRCKRN ALLVGPHRDV VRLLGDLRRN HRAGFRVVGI ALTDVDPAPD ARIDDVETFG LEELVDRAHH PRVTSVVLAG DLPGGRAAIR RLGWSLEGAA TELVLPSRLT YVAGPRIHLR PVEGMPLVHL SLPTYTGVAH VAKRGVDVVV ASLALVVLLP ALLAVAVAIK LDDGGPVLFR QERVGNREQL FTMYKFRTMV VDAEARLAAL QERNQGAGVL FKMTDDPRVT RVGRVLRAWS LDELPQFLNA LLGTMSVVGP RPPLPREVAL YDGDVHRRLL SKPGITGLWQ VSGRSDLTWE ESVQLDLSYV ENWSLSGDLM IILRTFRSVL ARAGAY
|
| |