Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1874 |
Symbol | |
ID | 8732315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1969591 |
End bp | 1970781 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646502491 |
Product | IMP dehydrogenase family protein |
Protein accession | YP_003393675 |
Protein GI | 284043335 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.31682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00497597 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAAATAG AGATCGGCCG CGGTAAGAAA GCTCGTCGCG CATACGGGTT CGATGATGTC GCCATCGTGC CCTCGCGACG CACGCGTGAC CCCGACGACG TCGACATCTC TTGGACGCTC GGCCCGTACC GTTTCGAGCA GCCGCTGCTC GCTTCGGCGC TCGACGGCGT CGTCTCGCCT GAGACGGCCG GGATCATCGG CAGACTCGGT GGTCTTGCCG TCCTCAACCT CGAGGGCATC TTCTGTCGCT ACGAAGACGT CGACGCCCAG CTCGAGAAGA TCGCCAGCCT CCCGCAGGAG GAGGCGACGC GCGAGATGCA GGAGATCTAC CGCGAGCCGG TCAAGCCCGA GCTGATCGCA CAGCGGATCC GCGAGATCAA GGACCAGGGC GTCGTCGTCG CCGCCTCGCT CACGCCGCAG CGCGTCCGCA CGCACTACGA GCTGGCGCTC GAGGCCGGAC TCGACATCCT CGTGATCCAG GGCACGGTCG TCTCCGCCGA GCACGTCTCG ACAGTGAGCG AGCCGCTCAA TCTGAAGGAG TTCATCGGCG AGGTGCCGGT GCCGGTCGTC GTCGGCGGCT GCGCCTCCTA TCACACCGGT CTCCACCTGA TGCGCACCGG CGCGGCCGGC GTGCTCGTCG GCGTCGGACC GGGTGCGATC TGCACCACGC GCGGCGTGCT CGGCATCGGC GTCCCGCAGG CGACGGCGAT CGCCGACGTC GCGGCAGCCC GCTCGCAGCA CATGCTCGAG ACCGGCGACT ACGTCCGCGT GATCGCCGAC GGCGGCATGA AGAACGGCGG CGACGTCGCG AAGGCGATCG CCTGCGGCGC CGACGCGGTC ATGCTCGGCT CCGCGCTCGC CAAGGCCGTC GAGGCCCCGG GCCGCGGCTA CAACTGGGGC ATGGCGACGT TCCACCCGAC GCTCCCGCGC GGCACGCGCG TGAAGACGCC GCAGAACGGC ACGCTCGAGG AGATCGTCAA CGGTCCCGCG CGCGAGAACG ACGGCATGTT CAACCTGATG GGCGCGCTGC GGACGTCGAT GGCGACCTGC GGCTATCGCG ACATCGCGGA GTTCAACCGC GCCGAGCTGA TGGTCGCGCC CGCGCTGCAG ACCGAGGGCA AGAGCCTCCA GCGCGACCAG CAGGTCGGCA TGGGTTCCAA CGGCCGGGCC GTCGCCCTTG TCAACGACTG A
|
Protein sequence | MEIEIGRGKK ARRAYGFDDV AIVPSRRTRD PDDVDISWTL GPYRFEQPLL ASALDGVVSP ETAGIIGRLG GLAVLNLEGI FCRYEDVDAQ LEKIASLPQE EATREMQEIY REPVKPELIA QRIREIKDQG VVVAASLTPQ RVRTHYELAL EAGLDILVIQ GTVVSAEHVS TVSEPLNLKE FIGEVPVPVV VGGCASYHTG LHLMRTGAAG VLVGVGPGAI CTTRGVLGIG VPQATAIADV AAARSQHMLE TGDYVRVIAD GGMKNGGDVA KAIACGADAV MLGSALAKAV EAPGRGYNWG MATFHPTLPR GTRVKTPQNG TLEEIVNGPA RENDGMFNLM GALRTSMATC GYRDIAEFNR AELMVAPALQ TEGKSLQRDQ QVGMGSNGRA VALVND
|
| |