Gene Shewmr4_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2044 
Symbol 
ID4252617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2431378 
End bp2433204 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content51% 
IMG OID638118660 
Productphosphogluconate dehydratase 
Protein accessionYP_734174 
Protein GI113970381 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.747422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.183886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCAG TCGTTCAATC TGTTACCGAC AGAATTATTG CCCGTAGCAA AGCATCTCGT 
GAAGCGTATT TAGCCGCGTT AAATGATGCT CGTAACCATG GCGTACACCG TAGCTCCTTA
AGCTGCGGTA ACTTAGCCCA CGGTTTTGCT GCTTGTAGTC CAGATGACAA AAATTCATTG
CGTCAACTGA CCAAAGCTAA CATCGGGATT ATTACCGCAT TCAACGATAT GTTGTCTGCA
CACCAACCCT ATGAAACCTA TCCTGAATTG CTGAAAAAAG CTTGTCAGGA AGTGGGCAGT
GTTGCACAGG TCGCAGGTGG TGTACCCGCG ATGTGTGACG GTGTGACTCA AGGTCAACCC
GGGATGGAAC TGAGCTTACT GAGTCGTGAA GTGATTGCCA TGGCGACGGC GGTGGGCTTA
TCCCACAACA TGTTTGATGG CGCCTTATTA CTGGGTATCT GCGACAAAAT CGTGCCGGGC
TTATTGATTG GCGCCTTAAG TTTTGGCCAT TTACCTATGC TGTTTGTGCC TGCAGGCCCA
ATGAAGTCGG GGATCCCAAA CAAGGAAAAA GCCCGTATTC GCCAGCAATT TGCCCAAGGT
AAAGTCGATA GAGCGCAGCT GCTTGAAGCC GAAGCGCAGT CTTACCACAG CGCTGGTACC
TGTACCTTCT ACGGTACGGC CAACTCGAAT CAGCTGATGC TTGAAGTGAT GGGGCTGCAA
TTGCCGGGTT CATCATTTGT AAATCCTGAC GATCCACTGC GTGAAGCCCT GAATAAAATG
GCGGCCAAGC AAGTGTGCCG CTTAACCGAA CTGGGTACTC AATACAGCCC AATCGGTGAA
GTGGTTAACG AGAAATCCGT CGTGAACGGC ATAGTGGCGC TACTGGCAAC GGGTGGTTCA
ACTAACTTAA CCATGCACAT TGTGGCGGCG GCGCGTGCGG CCGGCATTAT CGTTAACTGG
GATGATTTTT CTGAATTATC TGACGCGGTT CCATTATTGG CACGTGTTTA TCCAAACGGT
CATGCGGACA TTAACCACTT CCACGCCGCA GGCGGTATGG CTTTCCTTAT CAAGGAATTA
CTCGATGCGG GCCTACTGCA CGAGGATGTC AACACAGTTG CAGGTTTTGG TCTACGTCGT
TACACCCAAG AGCCCAAATT ACTCGATGGC GAGGTGCGCT GGGTAGATGG TCCAACCGTC
AGCCTAGATA CCGAAGTATT AACGTCTGTC GCTACGCCTT TCCAAAACAA CGGTGGCTTA
AAACTGCTTA AGGGCAACTT GGGTCGTGCT GTGATTAAAG TGTCAGCCGT GCAAGAAAAG
CACCGTGTAG TTGAAGCGCC AGCCGTGGTG ATTGACGATC AAAACAAACT GGATGCGCTG
TTTAAATCCG GCGCATTAGA CCGAGATTGT GTGGTGGTAG TAAAAGGTCA AGGGCCGAAA
GCGAACGGTA TGCCAGAGCT GCACAAGTTA ACGCCGCTGT TAGGCTCTTT GCAGGATAAA
GGCTTTAAAG TGGCACTGAT GACCGACGGT CGTATGTCAG GCGCATCGGG CAAAGTACCA
GCAGCGATTC ACTTAACGCC AGAGGCTATC GATGGCGGGC TGATTGCCAA AGTGCAAGAT
GGCGATCTTA TTCGTGTCGA CGCGCTGACC GGTGAGCTGA GCTTATTGGT CTCTGATGCC
GAACTTGCCG CGAGAACCGC TACAGAAATC GATTTACGCC ACTCACGCTA TGGAATGGGT
CGTGAGTTGT TTGGGGCACT GCGTTCAAAC TTAAGCAGTC CAGAAACCGG TGCGCGCAGT
ACCAGCGCCA TTGACGAACT TTATTAA
 
Protein sequence
MHSVVQSVTD RIIARSKASR EAYLAALNDA RNHGVHRSSL SCGNLAHGFA ACSPDDKNSL 
RQLTKANIGI ITAFNDMLSA HQPYETYPEL LKKACQEVGS VAQVAGGVPA MCDGVTQGQP
GMELSLLSRE VIAMATAVGL SHNMFDGALL LGICDKIVPG LLIGALSFGH LPMLFVPAGP
MKSGIPNKEK ARIRQQFAQG KVDRAQLLEA EAQSYHSAGT CTFYGTANSN QLMLEVMGLQ
LPGSSFVNPD DPLREALNKM AAKQVCRLTE LGTQYSPIGE VVNEKSVVNG IVALLATGGS
TNLTMHIVAA ARAAGIIVNW DDFSELSDAV PLLARVYPNG HADINHFHAA GGMAFLIKEL
LDAGLLHEDV NTVAGFGLRR YTQEPKLLDG EVRWVDGPTV SLDTEVLTSV ATPFQNNGGL
KLLKGNLGRA VIKVSAVQEK HRVVEAPAVV IDDQNKLDAL FKSGALDRDC VVVVKGQGPK
ANGMPELHKL TPLLGSLQDK GFKVALMTDG RMSGASGKVP AAIHLTPEAI DGGLIAKVQD
GDLIRVDALT GELSLLVSDA ELAARTATEI DLRHSRYGMG RELFGALRSN LSSPETGARS
TSAIDELY