Gene Mlg_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2601 
SymbolprpD 
ID4269234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2946388 
End bp2947842 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content67% 
IMG OID638127360 
Product2-methylcitrate dehydratase 
Protein accessionYP_743431 
Protein GI114321748 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000141042 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000146252 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGCCA CCGATCGGGG TATCAAGTCC ACCAAGCGTC CCAAACCGGA CAAAGAGCTG 
GTCGACATCG CCAAGTACGT CGCCAAGAAG GAGATCAAAT CCGACGAGGC CTACGAGACC
GCCCGCTACT GCCTGATGGA CACCCTGGCC TGCGGCATGC TGGCCCTGCA ATACCCCGCC
TGCACCAAGC TGCTCGGCCC CGTGGTCCCC GGGGCGCAGA TGGCCGACGG CGCCCGGGTG
CCCGGCACCC CCTACCAGCT CGACCCCGTC CAGGCCGCCT TCAACATCGG CGCCATGGTC
CGCTGGCTCG ACTTCAACGA CACTTGGCTC GCCGCCGAGT GGGGCCACCC CTCCGACAAC
CTGGGGGGCA TCCTCGCCGT GGCCGACTAC CTCAGCCGCC AGCGGCTGGC CCAGGGCAAG
AAACCCCTGA CCATGAAGGA CGTGCTCACC GCCATGATCA AGGCCCACGA GATCCAGGGG
GTGCTCGCGC TGGAGAACTC CTTCAACCGG GTGGGGCTGG ACCACGTCAT GCTGGTGCGC
ATCGCCACCA CCGCCGTCGC CACCGACATG CTGGGCGGCA GCCAGGACGA CATCACCAAC
GCCCTCTCCC ACGCCTTCAT CGACGGCTCC GCGCTGCGCA CCTACCGCCA CGCCCCCAAC
ACCGGCTCCC GCAAGAGCTG GGCCGCGGGC GATGCCAGTG CCCGCGGCGT GCGCCTGGCG
CTGATCGCCA TGACCGGCGA GATGGGCTAT CCCACCGCCC TGTCCGCCGA CTTCTGGGGC
TTCAACGACG TGCTCTTCAA GGGCAACAAG CTGGAGATCA ACCAGGCCTA CGACAGCTAC
GTGATGGAGA ATATCCTGTT CAAGATCTCC TTCCCCGCCG AATTCCACGC CCAGACCGCC
GTGGAGGCCG CCATGACCCT GCACGAGCAG GTGCGTGACC GGCTCGACGA GGTGGAGAAG
GTGGTCATCG AGACCCAGGA GGCCGGGGTG CGGATCATCG ACAAGGAGGG CCCGCTGGAC
AACCCCGCCG ACCGCGACCA CTGCATCCAG TACATGGTCG CCGTGCCGCT GATCCACGGC
CGGCTCACCG CCGACGACTA CGAGGACGCC GTCGCCGCCG ACCCGCGTAT CGACGCCCTG
CGCGAAAAGA TGGAGGTGGT GGAGCACAAG CCCTTCTCCA AGGACTACCT GGACCCGAAG
AAGCGCTCCA TTGGCAATGC CGTGCAGCTC TTCTTCAAGG ACGGGAGCAA GACCGACCGG
GTCGCGGTGG AGTACCCGAT CGGCCACCGC CGCCGGCGCA AGGAGGGTAT CCCGGTGCTG
GTGGATAAGT TCGAGAAGGC CCTGGCCACC CGCTTCGCCC CGCGGCGCGC CGAGGCCATT
CGCAAGGCCT GTGACAAGCA GAAGGGGCTG GAGCGCATGG CGGTCAACGA GTTCATGGCG
CTCTGGACCC TCTGA
 
Protein sequence
MSATDRGIKS TKRPKPDKEL VDIAKYVAKK EIKSDEAYET ARYCLMDTLA CGMLALQYPA 
CTKLLGPVVP GAQMADGARV PGTPYQLDPV QAAFNIGAMV RWLDFNDTWL AAEWGHPSDN
LGGILAVADY LSRQRLAQGK KPLTMKDVLT AMIKAHEIQG VLALENSFNR VGLDHVMLVR
IATTAVATDM LGGSQDDITN ALSHAFIDGS ALRTYRHAPN TGSRKSWAAG DASARGVRLA
LIAMTGEMGY PTALSADFWG FNDVLFKGNK LEINQAYDSY VMENILFKIS FPAEFHAQTA
VEAAMTLHEQ VRDRLDEVEK VVIETQEAGV RIIDKEGPLD NPADRDHCIQ YMVAVPLIHG
RLTADDYEDA VAADPRIDAL REKMEVVEHK PFSKDYLDPK KRSIGNAVQL FFKDGSKTDR
VAVEYPIGHR RRRKEGIPVL VDKFEKALAT RFAPRRAEAI RKACDKQKGL ERMAVNEFMA
LWTL