Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5618 |
Symbol | |
ID | 8736094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 6014821 |
End bp | 6016089 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 646506248 |
Product | amidohydrolase |
Protein accession | YP_003397397 |
Protein GI | 284047057 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.313113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.102085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGTG GGGCGAGCGG GCGGCGCGTG CTGCGCGGCG TCGCCGTGCT CGACGACAGC GGCCGCTTCG ACGGGCCGCT CGACGTCGCC GTCGAGGACG GGACGATCGC CGCCGTCGCG CCGTCGCTGC CCGGCACCGA CGGCGACGAC GCGCGCGGCC TCTTCCTGAT GCCGGGCGTG ATCGACTGCC ATGTCCACTT CGGCTGGCGC ACGATCGACA CGCTGGAGCT GCTCGGCCAG TCGCTCCAGC GCTGGACGCT GGAGGCCGCC GCCTGTGCGC AGCGGACGCT GGAGGCCGGC GTCACGACCG TGCGCGACGC GGGCGGGATC GATGCCGGCT TCCGCGACGC GCTCGCCGAC GGCGTCGCGC GCGGCCCGGC GGCGCAGGTC GCGGTCGTGC TGCTGAGCCA GACCGGCGGC CACGGCGACG GCTTCCTCGC CGGGCCCGGA CTGGAGGCGA CGGCGCAGTA CCTGACGCCC GACTGGCCCG GCCGGCCGCC GGCGCGCGTC GACGGCCCGG ACGAGATGCG CCGCGCCGTG CGCGCGCTGC TGCGTGCGGG CGCCGACTGG ATCAAGCTCT GCACGACCGG CGGCGTGCTC TCTCCGCACG ACCACCCGAC GCAGCCCGAG CTGAACGACG AGGAGGTTGC CGTCGCGGTC GCGGAGGCCG CCCGCCGCGG CCGCGGCGTG ATGGCGCACG CGAACGGCGG CGAAGGGCTC GACGTCGCGA TCCGCTGCGG CGTGCGCTCG GTCGAGCACG GCATCTGGCT GACGGAGGAG CAGGCTGCCG CGATGGCGGC GCGCGGGACG TGGCTGGTGC CGACGCTGCA GGTGATCCGC GACGCGATCG CGTGGGGCGA CGCCGGCAGG CTGCCGGACT ACGCCGTGCC GAAGGCGGCC GCGCTGCGCG AGCGCTGGGG TGAGGCGGTC CGGACCGCGC GCGCCCACGG CGTCCCGATC GCGCTCGGCT CCGACGCGCT CAGCGCCGGC CAGCACGGCG CGAACCTGGA GGAGGTCGCG CTGCTCGGCG ACGCCGGGAT GGAACCGCAC GAGGCGCTGC TCGCCGCGAC CGCTCGGGGC GCCGAGCTGC TCGGGATCGC GCACACGCAC GGCCGCATCG CGCCCGGCTT CGCGTTCGAC GCGATCGTCT TCGACGAGGA CCCGAGCGAC CTCGCCCTCT TCCGCCGCCG CGACGCGGTG CGCGGCGTCT TCCAGCGCGG GCGGACGATC GTCGCGTGCG AACGGCTCGC ACAGGAGGTG CCGGCATGA
|
Protein sequence | MSGGASGRRV LRGVAVLDDS GRFDGPLDVA VEDGTIAAVA PSLPGTDGDD ARGLFLMPGV IDCHVHFGWR TIDTLELLGQ SLQRWTLEAA ACAQRTLEAG VTTVRDAGGI DAGFRDALAD GVARGPAAQV AVVLLSQTGG HGDGFLAGPG LEATAQYLTP DWPGRPPARV DGPDEMRRAV RALLRAGADW IKLCTTGGVL SPHDHPTQPE LNDEEVAVAV AEAARRGRGV MAHANGGEGL DVAIRCGVRS VEHGIWLTEE QAAAMAARGT WLVPTLQVIR DAIAWGDAGR LPDYAVPKAA ALRERWGEAV RTARAHGVPI ALGSDALSAG QHGANLEEVA LLGDAGMEPH EALLAATARG AELLGIAHTH GRIAPGFAFD AIVFDEDPSD LALFRRRDAV RGVFQRGRTI VACERLAQEV PA
|
| |