Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2784 |
Symbol | |
ID | 9146692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3095746 |
End bp | 3097152 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | allantoinase |
Protein accession | YP_003637868 |
Protein GI | 296130618 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.766165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000087789 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGACGA CGACGCCGGA CGCGCTGCCC GTCCGCGGCT TCCCGCCCCA CCCCGCACCG GGCGACCTGC TCGCGGCCGT GCGCGGCCGC CAGGTGCTCG TCGACGGCGT GCTGCGGGCG GCGACCGTGC ACGTCGCCGA CGGTCGCGTG GCGGTGGTCG GCCCGTACGA CGACCCCGTC GAGGGCCCGG TCCTCGACGC CCCCGACCAC GCGTACGTCC TGCCCGGGGT GGTGGACTCG CACGTGCACG TCAACGAGCC GGGGCGCACC GCGTGGGAGG GCTTCGCGTC GGCGACGCGC GCGGCGGCCC TCGGCGGCGT CACGACGCTC GTCGACATGC CGCTCAACAG CATCCCGCCC ACGACGACGA CGCGTGCCCT GGCCGCCAAG CGCCGCGCGT CGACCGGTCA GCTCACGGTG GACGTCGCGC TGTGGGGCGG AGCGGTGCCG GGCAACCTCG ACGACCTGCA GCCGCTGTGG GACGCGGGGG TCATGGGCTT CAAGTGCTTC CTGTCCCCGT CGGGCGTCGA CGAGTTCCCG CCGCTGGGGC CGGTGGGCTT CGACGCGGCG CTGCGGACGG TCGCGGCGTT CGACGGCCTG GTGATCGTGC ACGCCGAGGA CCCGGGCGTG CTCGCGGACG CCCCCGTCCG GCCGGGCGGC GTGTACCGGG ACTTCGTGGC GTCGCGCCCG CACGAGGCGG AGACGGCCGC GGTCGCGCGC GTCGTCGCCG GTGCCCGGGA GACCGGCTGC CGTGTGCACG TCCTGCACCT GTCGAGCGCG CGTGCGCTGG ACCTGCTGGC CGACGCGCGC GCGGAGGGTC TGCCGGTGAC GGTCGAGACG TGCCCGCACT ACCTGACGTT CGACGCGGAC GCGATCCCCG ACGGCTCGGC GGCGCACAAG TGCTGCCCGC CGATCCGCGA CGCAGGCAAC CGCGAGGCGC TGTGGGACGG TCTGCGGGAG GGCGTCATCG ACGTCGTCGT CACCGACCAC TCCCCCGCGA CGGCCGACGA GAAGCTCCGC GGCGGCGGTG ACCTGCAGCA GGCGTGGGGC GGTGTCGCGG GCCTGCAGGT GGGTTTCCAG GCGGTCGCGC ACGGCGCCGC GGCCCGCGGG ATCGGCATCG AGGACGTGAG CCGGTGGATG TCGACGGGGA CGGCCGACCT GGTCGGCCTG CCGCACAAGG GCCGGATCGC CGCCGGCGCG GACGCCGACC TGCTCGTGCT CGACCCGCGC GCACCGCTGC ACGTCGACGT GCGCACGCTC GCGCACCGCC ACCCCATCAG CGCGTACGAC GGCGTCGAGC TCTCGGCGTC GGTGACGGCG AGCGTGCTGC GCGGGCGCGT CGTGGCACGT GACGGCGCGG TCACCGCGGG CACGGGTCGG CTGCTGTCCC GCCCTCGGCG GCCGTGA
|
Protein sequence | MTTTTPDALP VRGFPPHPAP GDLLAAVRGR QVLVDGVLRA ATVHVADGRV AVVGPYDDPV EGPVLDAPDH AYVLPGVVDS HVHVNEPGRT AWEGFASATR AAALGGVTTL VDMPLNSIPP TTTTRALAAK RRASTGQLTV DVALWGGAVP GNLDDLQPLW DAGVMGFKCF LSPSGVDEFP PLGPVGFDAA LRTVAAFDGL VIVHAEDPGV LADAPVRPGG VYRDFVASRP HEAETAAVAR VVAGARETGC RVHVLHLSSA RALDLLADAR AEGLPVTVET CPHYLTFDAD AIPDGSAAHK CCPPIRDAGN REALWDGLRE GVIDVVVTDH SPATADEKLR GGGDLQQAWG GVAGLQVGFQ AVAHGAAARG IGIEDVSRWM STGTADLVGL PHKGRIAAGA DADLLVLDPR APLHVDVRTL AHRHPISAYD GVELSASVTA SVLRGRVVAR DGAVTAGTGR LLSRPRRP
|
| |