Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2601 |
Symbol | prpD |
ID | 4269234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2946388 |
End bp | 2947842 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638127360 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_743431 |
Protein GI | 114321748 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | [TIGR02330] 2-methylcitrate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000141042 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0000146252 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGCCA CCGATCGGGG TATCAAGTCC ACCAAGCGTC CCAAACCGGA CAAAGAGCTG GTCGACATCG CCAAGTACGT CGCCAAGAAG GAGATCAAAT CCGACGAGGC CTACGAGACC GCCCGCTACT GCCTGATGGA CACCCTGGCC TGCGGCATGC TGGCCCTGCA ATACCCCGCC TGCACCAAGC TGCTCGGCCC CGTGGTCCCC GGGGCGCAGA TGGCCGACGG CGCCCGGGTG CCCGGCACCC CCTACCAGCT CGACCCCGTC CAGGCCGCCT TCAACATCGG CGCCATGGTC CGCTGGCTCG ACTTCAACGA CACTTGGCTC GCCGCCGAGT GGGGCCACCC CTCCGACAAC CTGGGGGGCA TCCTCGCCGT GGCCGACTAC CTCAGCCGCC AGCGGCTGGC CCAGGGCAAG AAACCCCTGA CCATGAAGGA CGTGCTCACC GCCATGATCA AGGCCCACGA GATCCAGGGG GTGCTCGCGC TGGAGAACTC CTTCAACCGG GTGGGGCTGG ACCACGTCAT GCTGGTGCGC ATCGCCACCA CCGCCGTCGC CACCGACATG CTGGGCGGCA GCCAGGACGA CATCACCAAC GCCCTCTCCC ACGCCTTCAT CGACGGCTCC GCGCTGCGCA CCTACCGCCA CGCCCCCAAC ACCGGCTCCC GCAAGAGCTG GGCCGCGGGC GATGCCAGTG CCCGCGGCGT GCGCCTGGCG CTGATCGCCA TGACCGGCGA GATGGGCTAT CCCACCGCCC TGTCCGCCGA CTTCTGGGGC TTCAACGACG TGCTCTTCAA GGGCAACAAG CTGGAGATCA ACCAGGCCTA CGACAGCTAC GTGATGGAGA ATATCCTGTT CAAGATCTCC TTCCCCGCCG AATTCCACGC CCAGACCGCC GTGGAGGCCG CCATGACCCT GCACGAGCAG GTGCGTGACC GGCTCGACGA GGTGGAGAAG GTGGTCATCG AGACCCAGGA GGCCGGGGTG CGGATCATCG ACAAGGAGGG CCCGCTGGAC AACCCCGCCG ACCGCGACCA CTGCATCCAG TACATGGTCG CCGTGCCGCT GATCCACGGC CGGCTCACCG CCGACGACTA CGAGGACGCC GTCGCCGCCG ACCCGCGTAT CGACGCCCTG CGCGAAAAGA TGGAGGTGGT GGAGCACAAG CCCTTCTCCA AGGACTACCT GGACCCGAAG AAGCGCTCCA TTGGCAATGC CGTGCAGCTC TTCTTCAAGG ACGGGAGCAA GACCGACCGG GTCGCGGTGG AGTACCCGAT CGGCCACCGC CGCCGGCGCA AGGAGGGTAT CCCGGTGCTG GTGGATAAGT TCGAGAAGGC CCTGGCCACC CGCTTCGCCC CGCGGCGCGC CGAGGCCATT CGCAAGGCCT GTGACAAGCA GAAGGGGCTG GAGCGCATGG CGGTCAACGA GTTCATGGCG CTCTGGACCC TCTGA
|
Protein sequence | MSATDRGIKS TKRPKPDKEL VDIAKYVAKK EIKSDEAYET ARYCLMDTLA CGMLALQYPA CTKLLGPVVP GAQMADGARV PGTPYQLDPV QAAFNIGAMV RWLDFNDTWL AAEWGHPSDN LGGILAVADY LSRQRLAQGK KPLTMKDVLT AMIKAHEIQG VLALENSFNR VGLDHVMLVR IATTAVATDM LGGSQDDITN ALSHAFIDGS ALRTYRHAPN TGSRKSWAAG DASARGVRLA LIAMTGEMGY PTALSADFWG FNDVLFKGNK LEINQAYDSY VMENILFKIS FPAEFHAQTA VEAAMTLHEQ VRDRLDEVEK VVIETQEAGV RIIDKEGPLD NPADRDHCIQ YMVAVPLIHG RLTADDYEDA VAADPRIDAL REKMEVVEHK PFSKDYLDPK KRSIGNAVQL FFKDGSKTDR VAVEYPIGHR RRRKEGIPVL VDKFEKALAT RFAPRRAEAI RKACDKQKGL ERMAVNEFMA LWTL
|
| |