Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2419 |
Symbol | prpD |
ID | 4026939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2721566 |
End bp | 2723050 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637967626 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_574465 |
Protein GI | 92114537 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | [TIGR02330] 2-methylcitrate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00018845 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCA ACGTCGAACA GAATCAACGT CCCGATTACG ACATAGAACT GCAGCGGATC GCCGATTACG TGCTGGAATA TCGCGTGGAG AGTCGCGAGG CTCTGGACAC TGCGCGCCAT TGCCTGATGG ATACGCTGGG CTGCGGTTTG CTGGCACTGC GCTTCCCCGA GTGCACCAAG CATCTCGGGC CACTGGTCGA AGGCACGGTG GTGCCGCATG GTGCCCGTGT GCCCGGTACC TCGCTGCGCC TCGATCCGGT CAAGGCCGCA TGGGATATCG GTGCCATCAT TCGCTGGCTG GATTACAACG ACACCTGGCT GGCGGCGGAA TGGGGGCACC CGTCGGACAA CCTGGGCGGC ATTCTGGCGG TGGCGGATCA CTTGTCGCAA AAACGGGTGG CCGTGGGCGA GCCGGCGCTG ACGATGCGTG ACGTGCTCGA CGCCATGGTC ATGGCGCACG AGATTCAGGG CGTGCTGGCC CTGGAGAACT CCTTCAATCG CGTGGGGCTC GATCACGTGG TGCTGGTCAA GGTCGCCTCG ACGGCCGTGG TCGCCAAGTT GATGGGCGCG GATCGCGAGC AGTTGCTGGC CGCGTTGTCG CATGCGTTCG TCGATGGTCA GAGTCTGCGC ACCTATCGTC ATGCACCGAA TGCCGGATCA CGCAAGTCAT GGGCGGCGGG AGACGCGACG TCGCGTGCCG TGCGTCTGGC GGATATCGCC ATGCGTGGCG AAATGGGCAT CCCCGGTGTG CTCAGCGCTC CCCAGTGGGG CTTTTACGAC GTGTCGTTCG CCAAGACTAA CAAGGATCAG CAGTTGAAGC CGGAGGCGGA GCGTCATTTC CGTGTCTCGC AGGCGTACGG GTCCTATGTC ATGGAGAACG TCCTGTTCAA GATCAGCTTC CCCGCGGAGT TCCATGCCCA GACCGCGTGC GAGGCGGCCG TTCTCCTGCA CCCTCAGGTC AAGGATCGCC TGGACGACAT CGACCGCATC GTGATCACGA CGCACGAGTC GGCGATTCGC ATCATCTCCA AGCACGGGCG TCTGGCCAAT CCGGCCGATC GTGACCACTG CCTGCAATAC ATGACGGCCG TGCCCCTGGC GTTCGGACAC CTGCAGGCGG AGCACTACGA AGATGCCTTC CACGAGGCAC ACCCGATCAT CGATCGGCTG CGCGACAAGA TGGAGGTCGT GGAGGACGAA CGCTATACCC GTGAATATCT CGAGGCGGAC AAGCGCTCCA TCGCCAATGC GGTCCAGGTC TTCTTCGTCG ATGGCAGCAG CACCGAACAG GTCGCCGTGG AGTATCCCAT CGGACATCGG CGGCGCCGTG CAGAGGGCAT GCCGCTGCTG GAAGAAAAGT TTCAGGCAAA CCTCGCGACA CGCTTTCCGG CGCGTCGCTG CGACGAGATT CTGGCGTTGT GCACGTCGCA GGAGCGCCTG GAAAGCACGC CCGTCCATCG CTTCGTCGAT CATTTCGTGA TTTGA
|
Protein sequence | MSANVEQNQR PDYDIELQRI ADYVLEYRVE SREALDTARH CLMDTLGCGL LALRFPECTK HLGPLVEGTV VPHGARVPGT SLRLDPVKAA WDIGAIIRWL DYNDTWLAAE WGHPSDNLGG ILAVADHLSQ KRVAVGEPAL TMRDVLDAMV MAHEIQGVLA LENSFNRVGL DHVVLVKVAS TAVVAKLMGA DREQLLAALS HAFVDGQSLR TYRHAPNAGS RKSWAAGDAT SRAVRLADIA MRGEMGIPGV LSAPQWGFYD VSFAKTNKDQ QLKPEAERHF RVSQAYGSYV MENVLFKISF PAEFHAQTAC EAAVLLHPQV KDRLDDIDRI VITTHESAIR IISKHGRLAN PADRDHCLQY MTAVPLAFGH LQAEHYEDAF HEAHPIIDRL RDKMEVVEDE RYTREYLEAD KRSIANAVQV FFVDGSSTEQ VAVEYPIGHR RRRAEGMPLL EEKFQANLAT RFPARRCDEI LALCTSQERL ESTPVHRFVD HFVI
|
| |