Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3036 |
Symbol | |
ID | 8733482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3242230 |
End bp | 3244101 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646503651 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_003394830 |
Protein GI | 284044490 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0256494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.08592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAC GCGGTGGAGA TCGCACGCCT GAGGAGCGCG AGCGCGCACG TCGCGAGCGC GAGCGGCGCC GCTACGAGCG CGCCGGCGAG CCCGTGCCCG AGCACCTGCT CGAGCCCGAG GAGCGCGCGG CCGCCGCGCC GCCTGAGCCG GTCGCACCGG AGCCTGAGCC GTTCGAAGCG GAGCCTGAGC CGTTCGAAGC GGAGCCCGAT CCGGTCGACG CGGAGCCGTT CGCACACGAG CCGGCGCCGG TCGAGCCGGA GCCCGTCGCC TACGAGCCGC CGCCGGTCGA GCCCGTCGGT CCGGAGCCGG CCGCCTATGA GCCGCCGCTC GAGCCGGTCG AGCCGGAGCC GGCCGCGCCC GTCGAGCCGC CCGCGGCGGA GCGGGGACAG TCGCACGATC CCCAGGAGAC GGTCCAGTGG GACGTCAGCC AGGCGTGGGC CGACGAGCAC CCGCCGCACG CCGACGTCGC CGCTCCCGCG GCCGCAGCGC ACGCGACCCA GGCGCACGAT CACGAGCACG CCGCCGCGGT CGAGCAGCCG CCGCAGGCCG AGCCGCCGCC CGCGCACGAC CCGCAGGCGA CCCAGGCGCA CGACGTGCAG GCCGACTGGG GCGCGGAGTG GACCGACGAG CACGCGGCTG TCGAGCAGCC GTCGGAGGAG CATGAGGCGC CACTCGGCAC CAAGCGCATC AGCGGCAAGG ACAGGATGCA CCTGCCGCAC ATCCACCGGC CGACGCGCGG CGAGCGGACC GGGAAGGGTC GCGGCGTGCG CGTGAAGCGG CCGGGGGAGA CGTTCGGCGG CGGCCCGCGC ACGCGGCGCA GCGTCGGCGG GCGGATCTTC GCAGGCTTGT TCGTGCTGCT CGGGATCGCG CTCGTGTGGT TCCTCGTCTC GCTGTTCCAG CCGTTCGGCG GCGGGGGCGA CGGCAGCGGC AGAGTCGCCG TCACGATCCC GGAGGGCGCG AGCGCCGGTG ACATCGGCAA GCTGCTGGCT AACAGAGGCG TCGTCGACTC CGGCTTCTTC TTCGGCCTGC GGGCGACCGT CTCCGGCGAG CGCAGCAACC TCAAGTCCGG CAGATACACG CTCAGAGAGG ACATGAGCTA CGGCGCGGCG CTCGACGCGC TGACGTCTGA GCCGGAGGTC AGAAGAGTCG CGACCGTCAG CGTCTCGATA CCGGAGGGCC GCAGCCGTCG CGAGACGGCG AGAATCGCCA GGCAGTCCGG CCTCAGGGGC GACTACTTCA CCGCCTCGCG CAGATCGCGC CAGCTCGACC CGCGCAGATA CGGCGCGCCG GCCGGCGCGA CGCTGGAGGG CTTCCTGTTT CCGGCGACAT ACGAGCTGAG ACGCGGCGCG AGAGTCCAGC GGCTCGTCGA CGACCAGCTG AGAGCGTTCA AGCAGAACTT CGCCGGGATC AACCTCAGAT TCGCCAGAAG CAAGCAGCTG ACCGCCTACG ACGTGCTGAC GATCGCCTCG ATGGTCGAGC GCGAGGTCAG CGTCGCGAGA GAGCGGCCGC TCGTCGCCGC CGTGATCTAC AACCGCCTGC GCGACTCGAT CCCGCTTGGG ATCGACGCGA CGCTGCGGTT CGAGCAGAAC GACTGGGTCA ACCCGCTGCG CCAGTCGGTG CTCGACGCCG ACACGCCGTA CAACACCCGC CGCAAGCTCG GCCTGCCGCC CGGGCCGATC GGCAGCCCCG GCCTCGCGTC GATCAGAGCG GCGGCGAACC CGGCCAGAAG CGACGCGCTC TACTACGTCG TCAGACCGGG GACGTGCGGC GAGCACGCGT TCGCGCCGTC CTACGAGCAG CACCTGCAGA ACGTCCAGCG CTACGAGCAG GCGCGGCAGG CCGCCGGCGG GAGATCGCCG ACGAGATGCT GA
|
Protein sequence | MSRRGGDRTP EERERARRER ERRRYERAGE PVPEHLLEPE ERAAAAPPEP VAPEPEPFEA EPEPFEAEPD PVDAEPFAHE PAPVEPEPVA YEPPPVEPVG PEPAAYEPPL EPVEPEPAAP VEPPAAERGQ SHDPQETVQW DVSQAWADEH PPHADVAAPA AAAHATQAHD HEHAAAVEQP PQAEPPPAHD PQATQAHDVQ ADWGAEWTDE HAAVEQPSEE HEAPLGTKRI SGKDRMHLPH IHRPTRGERT GKGRGVRVKR PGETFGGGPR TRRSVGGRIF AGLFVLLGIA LVWFLVSLFQ PFGGGGDGSG RVAVTIPEGA SAGDIGKLLA NRGVVDSGFF FGLRATVSGE RSNLKSGRYT LREDMSYGAA LDALTSEPEV RRVATVSVSI PEGRSRRETA RIARQSGLRG DYFTASRRSR QLDPRRYGAP AGATLEGFLF PATYELRRGA RVQRLVDDQL RAFKQNFAGI NLRFARSKQL TAYDVLTIAS MVEREVSVAR ERPLVAAVIY NRLRDSIPLG IDATLRFEQN DWVNPLRQSV LDADTPYNTR RKLGLPPGPI GSPGLASIRA AANPARSDAL YYVVRPGTCG EHAFAPSYEQ HLQNVQRYEQ ARQAAGGRSP TRC
|
| |