Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0441 |
Symbol | |
ID | 8730869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 450417 |
End bp | 451466 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646501055 |
Product | 4-hydroxy-2-oxovalerate aldolase |
Protein accession | YP_003392252 |
Protein GI | 284041912 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.216235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00115827 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGCCG CGGCTCAGGC GCCCGCGCGG CTCGACGTGC GCGTCACCGA CACCTGCCTG CGCGACGGCT CGCACGCCGT CGGCCACCGC TTCACGCGCG AGCAGGTGCG CGACGTCGTC GCCGCGCTCG ACGCGGCCGG CGTGCCGGTG CTGGAGGTCA CGCACGGCGA CGGGCTCGGC GGCAGCTCGT ACAACTACGG CTTCTCCGGC ACGCCCGAGC GCGAGCTGAT CGCGACCGCG GTCGCGACGG CGAGACGCGC GCGGATCGCC GCGCTGATGC TGCCCGGCGT CGGCACCGCG GACGACATCC GCGCCGTCGC CGACCTCGGC GTCGAGGTGA TCCGCGTCGC GACCCACTGC ACCGAGGCCG ACGTCGCGAT CCAGCACTTC GGGCTCGCGC GCGAGCTGGG GCTGGAGACG GTCGGCTTCC TGATGATGTC CCACTCGCAG CCGCCCGACG TGCTCGCCGC GCAGGCGCGC GTGATGGCCG ACGCCGGCTG CCAGTGCGTC TATGTCGTCG ACTCGGCTGG CGCGCTCGTG CTGGAGCAGG TCGCCGAGCG GGTCGAGGCG GTCGGCGCCG AGCTGGGCGA CGACGCGCAG GTCGGCTTCC ACGGCCACGA GAACCTCGGC CTCGCGATCG CGAACACGGT CGCCGCCGTC CGCGCCGGCG CGGCGCAGGT CGACGGCTGC ACGCGGCGGC TCGGCGCCGG CGCCGGCAAC ACGCCGACGG AGGCGCTGGC AGCCGTCTGC GAGAAGCTCG GGATCGAGAC CGGCCTCGAC GTGCTCGCGC TTGCCGACGC GGCGGAGGAG GTCGTGCGCC CGGCGATGGC GGCCGAGTGC ACGCTCGACC GCGGCGCGCT GCTGCTCGGC TACGCCGGCG TCTACTCGTC GTTCCTCAAG CACGCCGAGC GCTCCGCCGA GCGCTACGGC GTGTCGACCG CTCAGATCCT GCTGGCGTGC GGCGAGCGCC GGCTCGTCGG CGGGCAGGAG GACCAGATCA TCGCGATCGC CGCGGACCTC GCGGCGGCGC GCAAGGAGGA GGCCGCATGA
|
Protein sequence | MSAAAQAPAR LDVRVTDTCL RDGSHAVGHR FTREQVRDVV AALDAAGVPV LEVTHGDGLG GSSYNYGFSG TPERELIATA VATARRARIA ALMLPGVGTA DDIRAVADLG VEVIRVATHC TEADVAIQHF GLARELGLET VGFLMMSHSQ PPDVLAAQAR VMADAGCQCV YVVDSAGALV LEQVAERVEA VGAELGDDAQ VGFHGHENLG LAIANTVAAV RAGAAQVDGC TRRLGAGAGN TPTEALAAVC EKLGIETGLD VLALADAAEE VVRPAMAAEC TLDRGALLLG YAGVYSSFLK HAERSAERYG VSTAQILLAC GERRLVGGQE DQIIAIAADL AAARKEEAA
|
| |