Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2197 |
Symbol | |
ID | 8732640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2309585 |
End bp | 2311285 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646502815 |
Product | phenylacetic acid degradation protein paaN |
Protein accession | YP_003393997 |
Protein GI | 284043657 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02288] phenylacetic acid degradation protein paaN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTGC CCGCACCTGC AGTGGAGACC AGCGCGGAGC AGCTGGTCGA GCGTCATCGC GCGACGCTCG ACGCGGCCGT CGCCGCGATC GCCGAGCGCG GCTGGTGGTC GCCGTACGCC GAGGTGCCGA AGGCGTACGG GGAGAACGCG CTCGCGGAGG GGAGAGCCGC GTTCGAGGCC TACCTCGGCA CGCCGTTCCC ACTCGACCAG TTGGGCACGC TTGAGCAGGT CGGCGACGAG CGCTCGCCGT ACGGTCCGCG GCTCGGCGTC ACCTACCCGC GTCCCGACGT CGACGTCCTC CTCGCCGCGA TGCGGGAGGC GATCCCCGCC TGGCGCGACG CCGGCGCCGA GCAGCGCGCG GCGGTCGTGC TGGAGATCCT CGCGCGCCTC AACGCGCGTT CGGCCGAGAT CGCCGAGGCG GTCATGCACA CGACCGGCCA GGCGCCGGCG ATGGCGTTCC AGGCCGGTGC TCCGCATGCG CAGGATCGTG CGCTGGAGGC GGTCGCGTGG GCGGTCGCCG AGATGCGGCG GATCCCGTCC GACGCGCTGT GGGTCAAGCC GCAGGGCAAG CGCCCGCCGC TGCGGATGCA CAAGCGCTTC ACGGTCGTCC CGCGCGGCAT CGCGCTCGTG ATCGGCTGCA ACACCTTCCC GACGTGGAAC GGCTACCCGG GCCTGTTCGC CAGCCTCGTC TGCGGCAACC CCGTCGTCGT CAAGCCGAGC AGCCGCGCGA TCCTGCCGCT CGCGATCACC GTCGCGGTCG CCCGCGAGGT GCTCGCCGAG GCCGGCTTCT CGCCCGATCT GGTCGCGCTC GCCGCCGGCC GCGCCGACGA GCGGCTCGCG GCCGAGCTGG CGCTGCGCCC CGAGGTGCGG ATCGTCGACT ACACCGGCTC GACCGGCTTC GGCGAGTGGC TCGAGCGTGA GGCGCGCCAA GCCGCCGTAT TCACCGAGAA GGCGGGCGTC AACACGGTCG TGATCGACTC GACCGACGAC TACGCCGGCA TGCTCGCGAA CCTCGCCTTC ACGCTCTCGC TCTACAGCGG CCAGATGTGC ACGACGACGC AGAACCTGCT CGTGCCGGCC GGTGGGATCG AGACCGACCA GGGGCCGAAG ACGTTCGAGC AGGTCGGCGC CGACCTCGCC GCAGCGATCG ACGGTCTGCT CGGCGACGTC GGCCGCGCGA CGGCGATCCT CGGCGCGATC GTCAGCCCCG CGATCGCCGA GCGGCTCGAG CGCGCCGACA CGCTCGGCGA CGTCGTGCTC GCCTCGCGCC GGATCGAGCA CCCGCAGTTC CCCGACGCGG ACGTCCGCAC GCCGGCGCTC GTCGCCGTCT CCGGGCCGCA GGCGCCCGCG GCGAGCGAGG AGCAGTTCGG CCCGATCGCG CTGCTCGTGC CGACCGGCTC GACCGCGGAG AGCCTCGCGA CGCTGCGGCG GACCGTGCGC GAGCACGGCG CGATCACCGC CGGCGTCTAC AGCACCGACG AGGCGGTCCT CGCGGCGACC GAGGAGGTCG CGCTGGAGGT CGGCGTCGCG CTGTCGGCCA ACCTCACGCA GGGCGTCTAC GTCAACCAGT CGGCGGCGTA CTCGGACTTC CACGGGACGG CGCTGAACCC GGCCGCGAAC GCGTCGCTCG CCGACGCCGC CTTCGTCGCG CCGCGCTTCG GCGTCGTCCA GTCGCGCCGC CACGTGGAGG AGGAGGGGTG A
|
Protein sequence | MTVPAPAVET SAEQLVERHR ATLDAAVAAI AERGWWSPYA EVPKAYGENA LAEGRAAFEA YLGTPFPLDQ LGTLEQVGDE RSPYGPRLGV TYPRPDVDVL LAAMREAIPA WRDAGAEQRA AVVLEILARL NARSAEIAEA VMHTTGQAPA MAFQAGAPHA QDRALEAVAW AVAEMRRIPS DALWVKPQGK RPPLRMHKRF TVVPRGIALV IGCNTFPTWN GYPGLFASLV CGNPVVVKPS SRAILPLAIT VAVAREVLAE AGFSPDLVAL AAGRADERLA AELALRPEVR IVDYTGSTGF GEWLEREARQ AAVFTEKAGV NTVVIDSTDD YAGMLANLAF TLSLYSGQMC TTTQNLLVPA GGIETDQGPK TFEQVGADLA AAIDGLLGDV GRATAILGAI VSPAIAERLE RADTLGDVVL ASRRIEHPQF PDADVRTPAL VAVSGPQAPA ASEEQFGPIA LLVPTGSTAE SLATLRRTVR EHGAITAGVY STDEAVLAAT EEVALEVGVA LSANLTQGVY VNQSAAYSDF HGTALNPAAN ASLADAAFVA PRFGVVQSRR HVEEEG
|
| |