Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3438 |
Symbol | |
ID | 9147354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3831374 |
End bp | 3832462 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | Uroporphyrinogen III synthase HEM4 |
Protein accession | YP_003638511 |
Protein GI | 296131261 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000446843 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00117344 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCCCCG ACCGGCCGCC CCTCGACGAG CACGTCGCGA CCCCCACCGC CGGCCCCGCG GCGGCGTCGG GCCCCCTCAC CGGGTGGCGC GTCCTGGTCC CCCGCCCACC CCTCGACCCC ACCCCCACCC CTCCCTCCCC CGCGAGCCGT CGATCGACCC CCACCCCCAC CCCTCCCTCC CCCGCGAGCC GTCGATCGAC CCCCACCCCC ACCCCTCCCT CCCCCGCGAG CCGTCGATCG ACCCCCACCC CCACCCCTCC CTCCCCCGCG AGCCGTCGAT CGAGCCCCGC GGGAGGGTTC AGCCCAGCGG CGGTCGCGCT GCTGGCGGCG GGCGGCGAGC CGCTCGTGGT GCCGCTCGTG CGGACCGTGC CCGTCGACGA CCTCTCCCCG CTGGACGACG CCCTGCTCGC CCTCGGCGCC GGGTGGTACT CGTGGCTGAC CGTGACGAGC CAGGCGGCGG TCGCGGTGCT CGCCGAGCAG GCGGCGGCGC ACGACGACGG GCTCGCGGCC CTGGTCGAGC GCGGCGGTGC CCGCGTCGGT GCCGTCGGGC CCGGGACGGC GCGCGCGCTC GAGGGACTCG GCGTCCGGGT CGACGTCGTG CCACCCGTGC GGTCGACGGC CGTGGACCTC GTCGCCGCGC TCGTCGCCGC TGCCCCCGGC ACCCGCCCGG ACGCCGGGCC CCGCACCCTG TTCCCGCGCG GCGACCTCGC GGCCCCCACG CTCGCCGACG GCCTGACCGC CGCCGGCTGG GCCGTCGACG ACCTCGTCGT CTACCGCACG GTCCCGGCCG GTCCGCCCGA TCCCCAGGTC GCCGACGCCT GGGCGGGGGG CGACGTGCAC GCGGCCCTGC TGACGTCCGC GAGCAGCGTC CGCGCCCTGC TCGACCACCT CGGCCCGCCG CCGCCGGCGA CGCGCGTCGT CGTCATCGGC CCCAGCACCG AGACGGAGGC GCGCCGCATG GGTCTGCGCG TCGACGCCGT CGCCGGGCGG CAGACCCTCG CCGGGCTCGT CGACGCGCTC GTCGCGCACG TCGCCGCCAC CGGCGCGCGC CCCGGCTCCC CGCCCACCCC CGTGGAGGAC ACCCCGTGA
|
Protein sequence | MTPDRPPLDE HVATPTAGPA AASGPLTGWR VLVPRPPLDP TPTPPSPASR RSTPTPTPPS PASRRSTPTP TPPSPASRRS TPTPTPPSPA SRRSSPAGGF SPAAVALLAA GGEPLVVPLV RTVPVDDLSP LDDALLALGA GWYSWLTVTS QAAVAVLAEQ AAAHDDGLAA LVERGGARVG AVGPGTARAL EGLGVRVDVV PPVRSTAVDL VAALVAAAPG TRPDAGPRTL FPRGDLAAPT LADGLTAAGW AVDDLVVYRT VPAGPPDPQV ADAWAGGDVH AALLTSASSV RALLDHLGPP PPATRVVVIG PSTETEARRM GLRVDAVAGR QTLAGLVDAL VAHVAATGAR PGSPPTPVED TP
|
| |