Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3154 |
Symbol | |
ID | 9147069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3504315 |
End bp | 3506177 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003638235 |
Protein GI | 296130985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00273511 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCACGA CCTTCGACCG ACTGATCCTC GACCCACGCC GTACCGACGA CACCCGCCTG CGCCCGCAGA GCACCCGCTC GCCCGAGAAC GAGGCCACCC AGAGCCCGTC GCTCGTGCTG CTGGTGCTGC TGGCCGCGGC CGGCATCGTC GTGTACGCCG CGTTCCTGCT GAACCCCGCC AACCGGGGTG ACTTCCTGCC GTACGTGCTG GTCATCGTCG CCGAGACCGT CCTGGTCGCC CACGCCCTGC TGGCGATGTG GACCGTGCTC TCGGCCGGCT GGAACCCCCG CGGGTTCACG TTCCATCACT CGCAGGAGCG GCTGTACGAC CTCGCGGAGA TCATCCGCGA CGGCGCCGAG CACGAGCCGT GGCGCTGGCA GATGTACATG GACGACCGCC CCGTCGAGGT CGACGTCTTC ATCACGACGT ACGGCGAGGA CCTCGAGACC ATCCGCCGGA CGGTCACCGC GGCCCTGCGG ATCCAGGGCA GGCACCACAC GTGGGTGCTC GACGACGGCC GCTCCGACGA CGTGCGCGAC CTGGCCGCCG AGCTCGGTGC GCGCTACGTG CGCCGGCTGT CCAGCGGCGG CGCCAAGGCG GGCAACATCA ACCACGCCCT GTCCCTCGCG CGCGGCGACT ACTTCGCGGT GTTCGACGCG GACTTCGTGC CCCGGCCCGG GTTCCTGCAC GAGACCGTGC CGTTCTTCGC GACGCAGGAC GTCGCGTTCG TCCAGACGCC GCAGACGTAC GGCAACTACG ACAACGTCAT CAGCCGCGGC GCCGGTTACA TGCAGGCGGT CTTCTACCGG TTCGTGCAGC CAGGCCGGAA CAGGTTCAAC GCCGCGTTCT GCGTCGGCAC CAACGTCATC TACCGGCGGT CGGCGGTCGA CGCGATCGGT GGCATCTACA CCGACTCCAA GTCGGAGGAC GTGTGGACGT CGCTCATGCT GCACGAGCGC GGCTGGCGCA CGGTCTACAT CCCCACGACG CTCGCGGTCG GCGACACCCC CGAGACCGTC GAGGCGTACA CCAAGCAGCA GCTGCGCTGG GCGACCGGCG GCTTCGAGAT CATGCTCACG CACAACCCGC TGTCCCGGAA GCGCAACCTC ACGATGGACC AGCGCATCCA GTACCTCGTG ACGGCGACGC ACTACCTGAC GGGCATCGCC CCGCTGCTGC TGCTGCTCGT GCCGCCGCTG GAGATCTACT TCGACCTGTC CCCGATGGAC CTGACCATCA CGCCCGCGAC GTGGGCGCTG TACTACGCCG GGTTCTACGT GCTGCAGATC CTGCTCGCGT TCTACACGCT CGGGTCGTTC CGGTGGGAGG TGCTGCTGCT CGCGTCGGTG TCCTTCCCGA TCTACGTGCG TGCGCTGGTC AACGCGGTGC TGCGCCGCGA GCAGGCGTGG CACGTCACAG GACGCAAGGG CGCGTACCGC TCACCGTTCG CGTTCATGGT GCCGCAGGTG CTGTTCTTCC AGTTCCTGCT CCTGACGACC GTGGTCGCCG CGTGGAAGAC GTACACGTCG GGCGTGTTCA CGCTCGCGCT CGCCTGGAAC GCCACCAACA CGGTCATCCT GGGCGGCTTC ATGGTGACCG CGTGGCGCGA GGGGCGCCGC GGACGCGCCG AGGCGCGCGC CCGGCAGCGG GCGCTCGCGA CCCACGACCG AGCCGTCGAC GACCTCGCGG CCGACGCGGT CCTGCTCGAG CTGGAGGACG CCCGTACGCC CGCGGCCCGT CTGCTGGAGC GCGCCGAGCT CCTGCGGGCC GACAGCCCCG CAGCAGACGC GCCCCGCCCC GCCGGCGTCG AGCGGCCCGC CGGGCCTGCC GCCGCCCCCG CCGACCGAGA GGTGCAGTCA TGA
|
Protein sequence | MATTFDRLIL DPRRTDDTRL RPQSTRSPEN EATQSPSLVL LVLLAAAGIV VYAAFLLNPA NRGDFLPYVL VIVAETVLVA HALLAMWTVL SAGWNPRGFT FHHSQERLYD LAEIIRDGAE HEPWRWQMYM DDRPVEVDVF ITTYGEDLET IRRTVTAALR IQGRHHTWVL DDGRSDDVRD LAAELGARYV RRLSSGGAKA GNINHALSLA RGDYFAVFDA DFVPRPGFLH ETVPFFATQD VAFVQTPQTY GNYDNVISRG AGYMQAVFYR FVQPGRNRFN AAFCVGTNVI YRRSAVDAIG GIYTDSKSED VWTSLMLHER GWRTVYIPTT LAVGDTPETV EAYTKQQLRW ATGGFEIMLT HNPLSRKRNL TMDQRIQYLV TATHYLTGIA PLLLLLVPPL EIYFDLSPMD LTITPATWAL YYAGFYVLQI LLAFYTLGSF RWEVLLLASV SFPIYVRALV NAVLRREQAW HVTGRKGAYR SPFAFMVPQV LFFQFLLLTT VVAAWKTYTS GVFTLALAWN ATNTVILGGF MVTAWREGRR GRAEARARQR ALATHDRAVD DLAADAVLLE LEDARTPAAR LLERAELLRA DSPAADAPRP AGVERPAGPA AAPADREVQS
|
| |