Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2044 |
Symbol | |
ID | 4252617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 2431378 |
End bp | 2433204 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 638118660 |
Product | phosphogluconate dehydratase |
Protein accession | YP_734174 |
Protein GI | 113970381 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.747422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.183886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTCAG TCGTTCAATC TGTTACCGAC AGAATTATTG CCCGTAGCAA AGCATCTCGT GAAGCGTATT TAGCCGCGTT AAATGATGCT CGTAACCATG GCGTACACCG TAGCTCCTTA AGCTGCGGTA ACTTAGCCCA CGGTTTTGCT GCTTGTAGTC CAGATGACAA AAATTCATTG CGTCAACTGA CCAAAGCTAA CATCGGGATT ATTACCGCAT TCAACGATAT GTTGTCTGCA CACCAACCCT ATGAAACCTA TCCTGAATTG CTGAAAAAAG CTTGTCAGGA AGTGGGCAGT GTTGCACAGG TCGCAGGTGG TGTACCCGCG ATGTGTGACG GTGTGACTCA AGGTCAACCC GGGATGGAAC TGAGCTTACT GAGTCGTGAA GTGATTGCCA TGGCGACGGC GGTGGGCTTA TCCCACAACA TGTTTGATGG CGCCTTATTA CTGGGTATCT GCGACAAAAT CGTGCCGGGC TTATTGATTG GCGCCTTAAG TTTTGGCCAT TTACCTATGC TGTTTGTGCC TGCAGGCCCA ATGAAGTCGG GGATCCCAAA CAAGGAAAAA GCCCGTATTC GCCAGCAATT TGCCCAAGGT AAAGTCGATA GAGCGCAGCT GCTTGAAGCC GAAGCGCAGT CTTACCACAG CGCTGGTACC TGTACCTTCT ACGGTACGGC CAACTCGAAT CAGCTGATGC TTGAAGTGAT GGGGCTGCAA TTGCCGGGTT CATCATTTGT AAATCCTGAC GATCCACTGC GTGAAGCCCT GAATAAAATG GCGGCCAAGC AAGTGTGCCG CTTAACCGAA CTGGGTACTC AATACAGCCC AATCGGTGAA GTGGTTAACG AGAAATCCGT CGTGAACGGC ATAGTGGCGC TACTGGCAAC GGGTGGTTCA ACTAACTTAA CCATGCACAT TGTGGCGGCG GCGCGTGCGG CCGGCATTAT CGTTAACTGG GATGATTTTT CTGAATTATC TGACGCGGTT CCATTATTGG CACGTGTTTA TCCAAACGGT CATGCGGACA TTAACCACTT CCACGCCGCA GGCGGTATGG CTTTCCTTAT CAAGGAATTA CTCGATGCGG GCCTACTGCA CGAGGATGTC AACACAGTTG CAGGTTTTGG TCTACGTCGT TACACCCAAG AGCCCAAATT ACTCGATGGC GAGGTGCGCT GGGTAGATGG TCCAACCGTC AGCCTAGATA CCGAAGTATT AACGTCTGTC GCTACGCCTT TCCAAAACAA CGGTGGCTTA AAACTGCTTA AGGGCAACTT GGGTCGTGCT GTGATTAAAG TGTCAGCCGT GCAAGAAAAG CACCGTGTAG TTGAAGCGCC AGCCGTGGTG ATTGACGATC AAAACAAACT GGATGCGCTG TTTAAATCCG GCGCATTAGA CCGAGATTGT GTGGTGGTAG TAAAAGGTCA AGGGCCGAAA GCGAACGGTA TGCCAGAGCT GCACAAGTTA ACGCCGCTGT TAGGCTCTTT GCAGGATAAA GGCTTTAAAG TGGCACTGAT GACCGACGGT CGTATGTCAG GCGCATCGGG CAAAGTACCA GCAGCGATTC ACTTAACGCC AGAGGCTATC GATGGCGGGC TGATTGCCAA AGTGCAAGAT GGCGATCTTA TTCGTGTCGA CGCGCTGACC GGTGAGCTGA GCTTATTGGT CTCTGATGCC GAACTTGCCG CGAGAACCGC TACAGAAATC GATTTACGCC ACTCACGCTA TGGAATGGGT CGTGAGTTGT TTGGGGCACT GCGTTCAAAC TTAAGCAGTC CAGAAACCGG TGCGCGCAGT ACCAGCGCCA TTGACGAACT TTATTAA
|
Protein sequence | MHSVVQSVTD RIIARSKASR EAYLAALNDA RNHGVHRSSL SCGNLAHGFA ACSPDDKNSL RQLTKANIGI ITAFNDMLSA HQPYETYPEL LKKACQEVGS VAQVAGGVPA MCDGVTQGQP GMELSLLSRE VIAMATAVGL SHNMFDGALL LGICDKIVPG LLIGALSFGH LPMLFVPAGP MKSGIPNKEK ARIRQQFAQG KVDRAQLLEA EAQSYHSAGT CTFYGTANSN QLMLEVMGLQ LPGSSFVNPD DPLREALNKM AAKQVCRLTE LGTQYSPIGE VVNEKSVVNG IVALLATGGS TNLTMHIVAA ARAAGIIVNW DDFSELSDAV PLLARVYPNG HADINHFHAA GGMAFLIKEL LDAGLLHEDV NTVAGFGLRR YTQEPKLLDG EVRWVDGPTV SLDTEVLTSV ATPFQNNGGL KLLKGNLGRA VIKVSAVQEK HRVVEAPAVV IDDQNKLDAL FKSGALDRDC VVVVKGQGPK ANGMPELHKL TPLLGSLQDK GFKVALMTDG RMSGASGKVP AAIHLTPEAI DGGLIAKVQD GDLIRVDALT GELSLLVSDA ELAARTATEI DLRHSRYGMG RELFGALRSN LSSPETGARS TSAIDELY
|
| |