Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3099 |
Symbol | |
ID | 9147011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3442653 |
End bp | 3445169 |
Gene Length | 2517 bp |
Protein Length | 838 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | glycosyl transferase family 51 |
Protein accession | YP_003638181 |
Protein GI | 296130931 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.113974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000163958 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGACAC CTGCACGCGC GCGCGGACGC CGGATCAGCG CCGTCCAGGC GCTCGCCCTG CTGCTCTCCT TCAGTCTCGT CGCCGGGCTC GGCGGTGTCC TCGCCGCCGG CCTCGTCCTG CCCGGCGTGG CCGTGGCCAA CGGCATCACG GGCATGACCG TCACGGCGTT CGACGACCTG CCGAGCGAGC TCGAGCAGCG GCCGCTGCCG GAGAAGTCGG AGATCCTCGC GGCCGACGGC ACGCTGCTCG CGACGTTCTA CGTCCAGAAC CGCATCGTCG TGCCGTTGTC CGAGATCGCG CCGATCATGC AGCAGGCCGT CATCGCGGTC GAGGACCGCC GGTTCTACGA GCACTCCGGC GTCGACCCGG CCGGCATGCT GCGGGCCGCG ATCTCCAGCG CGGCCGGCGC CCAGCAGGGC GCGTCCACGC TCACGCAGCA GTACGTGAAG AACGTCTTCC TCGACGCCGC GGAGCGCGCG GAGGACGACG CCGAGCGCGA CCGGCTGCGT GCCGAGGCCA AGGTCAGCAA GGGCGCCGAG GGCATCGCCC GCAAGCTGCG CGAGGCGAAG ATCGCCATCA CGCTCGAGAA GACGATGACG AAGGACGAGA TCCTCGAGAA GTACCTCAAC ATCGCCGCCT TCGGGGCGTC GGTGTACGGC GTGGAGTCGG CGGCCCGCTA CTACTTCAGC AAGTCCGCGA AGGACCTGAA CTACCTCGAG GCCGCGACGA TCGCGGGCAT CACGCAGTCC CCCAGCGTCT GGGACCCGGT CGGCCGGGCC GACGAGACGC CCGAGCAGGA CGCGGAGCGA TACGCGAACT CCAAGACGCG TCGCGACAAG GTGCTCAGGG ACATGGAGCA GGAGGGCTAC ATCACGCCGG AGGAGCTGGA GACGGGCCTC GCGACGCCCG TGGAGGCCAC CCTCAACGTC TCCAAGCTCA ACCAGGGCTG CATGTCCGCC GACACCACCG TCCCCGGCTC CGGCTTCTTC TGCGACTACG TCACCAAGGT CATCGCGAAC GACCCCGCCT TCGGGGAGAC CGCGCGCGAC CGGACCAACC TGCTCTACAC GGGCGGCCTG ACGATCACGA CGACGCTGGT GCCCGGTGAG CAGGCGATCG CCGACGCCGA GGTCAAGAAC GGCGTCCCCG TGAACGACCC GTCGGGTGTC GCGAGCTCGA TCGTCTCGGT GGAGCCCGGG ACCGGCAAGA TCACCGCGAT GGCGCAGAAC CGCGTCTACT CGGCGCTCAA GGAGCAGAAC CCCGGCGAGA CCGCGGTCAA CTTCAACACG AGCTTCCAGT ACGGCGGGTC GGGCGGGTTC GCCCCCGGCT CGACGTTCAA GGTCTTCACG CTGCTCGAGT GGCTCAAGCG GGGTCACGCG CTCAACGAGA CCGTCAACGG CTCGCGCCTG ACGTACAACA CCAACGAGTT CACGGCCTCG TGCGTCGGGC GGCTGGGCAA CGAGAAGTTC CCGTTCGGCA ACTCCGAGGG CGGCAAGGCG ATCAACCAGT CCGTCATCGA CGCCACGCGC AACTCGGTGA ACTCGGCGTA CATCGCGATG GCGGCGCAGC TCGACCTCTG CGCGATCATG CAGGGCGCCG CCGACCTCGG TGTGACCAAG GCCGGCAACC CCAACCAGAT CAACCCGCTC ACCAACACCC CCATGGGCAA CGTGCCGTTC GACCCGTTTC CCTCCGTCGT GCTCGGCACC GACTCGACCT CGCCGCTGCA GATGGCCGCG GCGTACGCGA CGTTCGCGTC CAACGGCACC CACTGCAAGC CGATCGCGAT CACGCAGGTG CTCGACGCCC AGGGCAACGA GCTGCCGATC CCGACCGCCG ACTGCCGGGT CGGCGCGATC GACCCGCGGT ACGCGTCGGC CATGAACTTC GCGCTGAGCA ACGTGTGGAC CGGCACGGCC AAGGACGTGG GCAAGCCGCC GTTCCCGGCC GCCGGCAAGA CGGGGACCAC GACGAAGAAC GAGTACAACT GGTTCGTCGG GTACACGCCG CTGCGCGCGA CGGCCGTCTG GGTCGGCTAC AGCGAGAACA TGCGCACCAT GAACCGCGAG ACGATCAACG GGAAGTACTA CGGGTACGGC CCGTACGGCT CGTCGATCGC CGCCCCCACG TGGAAGCGGT TCATGGTCCA GGTGCTCAAC GGCGCCGACA ACCAGGACTT CGCGAAGCCC GCCGACCGTG AGCTCAACGG CGAGCGCGTG GGCGTGCCGT CCGTGGTCGG CCAGAACGAG CAGCGGGCCC GCGAGATCCT CGAGGGTGCC GGGTTCCGGG TCTCCGTGTC CGGCGAGCAG GTGCCGTCCT CCTACCCGGC CGGGACGGTC GCGGAGCAGT CCGCGACGCT GGCTCCCCGC GGCTCGTCCA TCTCGCTGAA GATCTCGAAC GGCCAGCAGC CGGGCGGTGG CGGGCAGGGT GGGTTCCCGG GCGGCGGCGG CCAGGGAGGG TTCCCGGGTG GCCCACCGGC GCCGCCCAAC CCGGGCGAAG GGCGTCGGGA CCGGTGA
|
Protein sequence | MPTPARARGR RISAVQALAL LLSFSLVAGL GGVLAAGLVL PGVAVANGIT GMTVTAFDDL PSELEQRPLP EKSEILAADG TLLATFYVQN RIVVPLSEIA PIMQQAVIAV EDRRFYEHSG VDPAGMLRAA ISSAAGAQQG ASTLTQQYVK NVFLDAAERA EDDAERDRLR AEAKVSKGAE GIARKLREAK IAITLEKTMT KDEILEKYLN IAAFGASVYG VESAARYYFS KSAKDLNYLE AATIAGITQS PSVWDPVGRA DETPEQDAER YANSKTRRDK VLRDMEQEGY ITPEELETGL ATPVEATLNV SKLNQGCMSA DTTVPGSGFF CDYVTKVIAN DPAFGETARD RTNLLYTGGL TITTTLVPGE QAIADAEVKN GVPVNDPSGV ASSIVSVEPG TGKITAMAQN RVYSALKEQN PGETAVNFNT SFQYGGSGGF APGSTFKVFT LLEWLKRGHA LNETVNGSRL TYNTNEFTAS CVGRLGNEKF PFGNSEGGKA INQSVIDATR NSVNSAYIAM AAQLDLCAIM QGAADLGVTK AGNPNQINPL TNTPMGNVPF DPFPSVVLGT DSTSPLQMAA AYATFASNGT HCKPIAITQV LDAQGNELPI PTADCRVGAI DPRYASAMNF ALSNVWTGTA KDVGKPPFPA AGKTGTTTKN EYNWFVGYTP LRATAVWVGY SENMRTMNRE TINGKYYGYG PYGSSIAAPT WKRFMVQVLN GADNQDFAKP ADRELNGERV GVPSVVGQNE QRAREILEGA GFRVSVSGEQ VPSSYPAGTV AEQSATLAPR GSSISLKISN GQQPGGGGQG GFPGGGGQGG FPGGPPAPPN PGEGRRDR
|
| |