Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0897 |
Symbol | |
ID | 9144771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 979474 |
End bp | 980964 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | capsular exopolysaccharide family |
Protein accession | YP_003636005 |
Protein GI | 296128755 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.240779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGCTGA CCGACCAACT GCGGGCGATC CGCAAGAACT GGTGGATCGT CGCACTGACC GTGCTGACGA CGGTCGGGGC AGCGCTGCTC GTCACCGTGC GCGCGACGCC GGAGTACGAG AGCACCCTGA CCTTCTTCGT CGCGGCGTCG AGCGACACGG GCACCGCGCT GCAGGCCGAC GAGTTCGCGC AGCGCCGCGT CGCCGCCTAC GCCGGCGTCC TGACGAGCGG CCGCCTGGCC GAGCGGATCG CGGCCAACCG GTCGCTCGGC CTGGACTCCA GGGCGATCGC GTCACGCATC TCCGCGACGC CCCAGGAGGA CGCGATCCTG CTCAGCGCCG TGGTCCGGGA CACCGACCCG GCGCGCGCGC AGCAGATCGG CGAGGCGATC GAGGAGGAGC TGGGCCCGCT CGTCCAGGAG CTCGAGCGCA CCGACGCCGC CAGCCGCGCC AACGTCTCGC TCACCGTCAT CTCCGGCCCG ACCGCCCCCA CCTCGCCCGT CTCGCCGCGC CCCGCGCTGA ACGTCGGCGT CGGCCTGCTC GCCGGCCTCG CGCTGGGCGT GTCCCTGGCC GTCGCGCGGC AGGTGCTCGA CCGCACGGTC CGCACCCCGG AGCACGTGCG CGCCAGCACG TCGCTGCCCG TGCTCTCCAC GATCACCGCC CAGCACCCGC GCAGCCGCCG CCGTGCGCGC CCCGACGGCC TGGTGGCCGC GGCCGACCCG GGCTCGCCGC GCGCCGAGGC CTACCGACGG CTGCGCACCA ACCTCACGTT CAGCGCCGCG ACGCACCGCA TGCAGGTCAT CGTCGTCACG TCACCGCTGG CCGGCGAGGG CAAGACGACC ACGAGCTGCA ACCTCGCGAT CGCGCTCGCG GAGAGCGGGC GGCGCGTGCT GCTCGTCGAG GGTGACCTGC GCCGGCCCCG CGTCTCGCGG GCGCTCGGGC TCGAGGGCGC CGTCGGTCTC ACCAACGTGC TCGTGGGGCA GGTCGAGGAG GCCGACGTCA TCCAGCAGTG GGGTCCCCAC GGGCTGTTCG TGCTGCCGGC CGGCACCCTG CCGCCCAACC CGTCCGAGCT GCTCGGCAGC GACAAGATGC GCGCCTTCGT GCAGCGCATG CGGCAGCGCT TCGACGTCGT CATCCTCGAC ACGCCGCCGA CCCTGCCCGT CACCGACGCG ACGATCGCGG CCGCGCACGC CGACAGCGTC GTGCTCGTCG TCCGCTACGG GCACACGACG CGCGACCAGG CCCGCTCGGC TGTCGAGTCG CTGCGCGTCG TCGACGCACC GCTCGCGGGC GTCATCATCA ACGGCGCCCC GCTGCGATCG GCCGGCGTGC CCTACTCGCA CGACGCGGGC CGCCCCCGGT CGGCGGACGT GGCACCCGCC GCGGTCGCGC CCGCCGGTGG CGCCGCGGGC GCGGGCCGCC CGACCGCGGC CGACCCCGCC GCCGGTCTGC CGAACGACGC CCGCCACGTC GGCGCCCACC CGGACCGCTG A
|
Protein sequence | MELTDQLRAI RKNWWIVALT VLTTVGAALL VTVRATPEYE STLTFFVAAS SDTGTALQAD EFAQRRVAAY AGVLTSGRLA ERIAANRSLG LDSRAIASRI SATPQEDAIL LSAVVRDTDP ARAQQIGEAI EEELGPLVQE LERTDAASRA NVSLTVISGP TAPTSPVSPR PALNVGVGLL AGLALGVSLA VARQVLDRTV RTPEHVRAST SLPVLSTITA QHPRSRRRAR PDGLVAAADP GSPRAEAYRR LRTNLTFSAA THRMQVIVVT SPLAGEGKTT TSCNLAIALA ESGRRVLLVE GDLRRPRVSR ALGLEGAVGL TNVLVGQVEE ADVIQQWGPH GLFVLPAGTL PPNPSELLGS DKMRAFVQRM RQRFDVVILD TPPTLPVTDA TIAAAHADSV VLVVRYGHTT RDQARSAVES LRVVDAPLAG VIINGAPLRS AGVPYSHDAG RPRSADVAPA AVAPAGGAAG AGRPTAADPA AGLPNDARHV GAHPDR
|
| |