Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1775 |
Symbol | |
ID | 9145664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1979035 |
End bp | 1980144 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Uroporphyrinogen III synthase HEM4 |
Protein accession | YP_003636871 |
Protein GI | 296129621 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.488954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.025641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACC AGACGCTGGC CGGGTGCGTC GTGCTCGTCA CGGCGGACCG GCGGGCTGCC GAGCTGCGCG CTGCGCTCGA GCGGCGTGGC GCGACGGTGC GTCACGCGCC CGCGCTCGGG ATGGTGCCGC ACACCGACGA CGCCCTGCTG CTCGCCCGCA CGCGCGACCT GCTCGCGGAC CCGCCGGACA CCGTGGTCGT CACCACGGGC ATCGGCTTCC GCGGGTGGGT CGAGGCAGCC GACGCCGCCG GGATCGCGGA CCGTCTGCTC GAGGTGCTGG CCACGACCCG CATCGTGGCC CGCGGCCCGA AGGCGCGCGG CGCCATCCAG GCCGCCGGGC TGAGCGCCGA CTGGGTGGCG GAGTCCGAGA CGAGCGCCGA GATCGCCGAG GTGCTGCTCG ACGAGGGCGT CGCCGGCCGC GACGTCGTCA TCCAGCACCA CGGCGCCGGC GCGGACGGCC TCGACGAGGC GTTCGCCGCG GCCGGGGCGC GCGTGCGCAG CCTCGTCGTC TACCGGTGGG GTCCGCCGCC CGACCCGGAC CTCGTCCGCG ACTCCGCCCG CGCGGTGGCC GACGGCGAGA TCGACACCGT CGTGTTCACG TCGGCGCCCG GCGCCGCGGC GTGGGTCGCG GCGGCACGCG ACGCGGACGC GCTCGACGGC GTGCTGCGCC GCCACACGTC CGGGGCGGTG GTGTTCGCGG CCGTCGGACC GGTCACCGCC AAGCCGCTGG TCGACGTCGG CATCGACCCG CTCGTGCCGG ACCGGGGGCG GCTCGGCTCG CTCGTGCGCG CCGTCGTGAC GCACTACGGC GGCCTCGAGG CACTCGACAC CGTCGCGGGC CAGCTGCGCG TGCGCCGCGG CGCCGCCGTC CTCGACGGGC GCGTGCTGCC GCTGTCGCGC ACCGGGCTCG AGGTGCTGCG GCTGCTCGCG CACGCGCGCG GCTCGGTCGT CCCGCGTGAC CGCGTCCTCG ACGTGCTGCC GGGGGTATCG TCCGACCCGC ACGCGGCCGA GGTCGCGATC GCGCGGCTGC GGGACGCCAC CGGCAGCCGT GCGCTCATCC GCACGGTGGT CAAGCGCGGC TACCGGCTCG AGCTCCAGGA GCAGGCATGA
|
Protein sequence | MIDQTLAGCV VLVTADRRAA ELRAALERRG ATVRHAPALG MVPHTDDALL LARTRDLLAD PPDTVVVTTG IGFRGWVEAA DAAGIADRLL EVLATTRIVA RGPKARGAIQ AAGLSADWVA ESETSAEIAE VLLDEGVAGR DVVIQHHGAG ADGLDEAFAA AGARVRSLVV YRWGPPPDPD LVRDSARAVA DGEIDTVVFT SAPGAAAWVA AARDADALDG VLRRHTSGAV VFAAVGPVTA KPLVDVGIDP LVPDRGRLGS LVRAVVTHYG GLEALDTVAG QLRVRRGAAV LDGRVLPLSR TGLEVLRLLA HARGSVVPRD RVLDVLPGVS SDPHAAEVAI ARLRDATGSR ALIRTVVKRG YRLELQEQA
|
| |