Gene Shewana3_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2149 
Symbol 
ID4478345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2573951 
End bp2575777 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content51% 
IMG OID639726737 
Productphosphogluconate dehydratase 
Protein accessionYP_869785 
Protein GI117920593 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00662337 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.165329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCAG TCGTTCAATC TGTTACCGAC AGAATTATTG CCCGTAGCAA AGCATCTCGT 
GAAGCGTATT TAGCCGCGTT AAATGATGCT CGTAACCATG GCGTACACCG TAGCTCCTTA
AGCTGCGGTA ACTTAGCCCA CGGTTTTGCT GCTTGTAGTC CAGATGACAA AAATTCATTG
CGTCAACTGA CCAAAGCTAA CATCGGGATT ATTACCGCAT TCAACGATAT GTTGTCTGCA
CACCAACCCT ATGAAACCTA TCCTGAATTG CTGAAAAAAG CTTGTCAGGA AGTGGGCAGT
GTTGCACAGG TCGCAGGTGG TGTACCTGCA ATGTGTGACG GTGTGACTCA AGGTCAACCC
GGGATGGAAC TGAGCTTACT GAGTCGTGAA GTGATTGCCA TGGCGACGGC GGTGGGCTTA
TCCCACAACA TGTTTGATGG CGCCTTATTA CTGGGTATCT GCGACAAAAT CGTGCCGGGC
TTATTGATTG GCGCCTTAAG TTTTGGCCAT TTACCTATGC TGTTTGTGCC TGCAGGCCCA
ATGAAGTCGG GGATCCCAAA CAAGGAAAAA GCCCGTATTC GCCAGCAATT TGCCCAAGGT
AAAGTCGATA GAGCGCAGCT GCTTGAAGCC GAAGCGCAGT CTTATCACAG CGCTGGCACT
TGTACCTTCT ACGGTACGGC TAACTCGAAC CAGCTGATGC TCGAAGTCAT GGGGCTGCAA
TTGCCGGGTT CATCCTTTGT GAATCCTGAC GACCCACTGC GTGAAGCGCT GAATAAAATG
GCGGCCAAGC AGGTGTGCCG CTTAACAGAG CTTGGCACTC AATACAGCCC AATCGGTGAA
GTGGTTAACG AGAAATCCGT GGTGAACGGC ATAGTGGCGC TACTGGCAAC GGGTGGTTCA
ACTAACTTAA CCATGCACAT TGTGGCGGCG GCGCGTGCGG CTGGCATTAT CGTGAACTGG
GATGATTTTT CTGAATTATC TGACGCGGTT CCTTTGCTGG CCCGTGTTTA TCCAAACGGT
CATGCGGACA TTAACCACTT CCACGCCGCG GGCGGTATGG CTTTCCTTAT CAAGGAATTA
CTCGACGCGG GCCTCCTACA TGAGGATGTC AACACAGTTG CCGGTTTTGG TCTACGTCGT
TACACCCAAG AGCCAAAATT ACTCGATGGC GAGCTGCGCT GGGTCGATGG TCCAACCGTC
AGCTTAGATA CTGAAGTATT AACGTCTGTC GCTACGCCTT TCCAAAACAA CGGTGGCTTA
AAACTGCTTA AGGGCAACTT GGGCCGTGCG GTGATTAAAG TGTCAGCCGT GCAAGAAAAA
CACCGTGTAG TTGAAGCGCC TGCCGTGGTG ATTGACGATC AAAACAAACT CGATGCGCTG
TTTAAATCCG GCGCATTAGA CCGAGATTGT GTGGTGGTAG TTAAAGGCCA AGGGCCGAAA
GCGAACGGTA TGCCTGAGCT GCACAAGTTA ACGCCACTGT TAGGCTCTCT GCAGGATAAA
GGCTTTAAAG TGGCTCTGAT GACCGACGGT CGTATGTCGG GCGCATCGGG CAAAGTACCA
GCCGCAATTC ACTTAACGCC AGAGGCTATC GATGGAGGGC TAATTGCCAA AGTGCAAGAT
GGCGATCTGA TTCGTGTCGA CGCACTGACC GGTGAGCTGA GCTTATTGGT CTCCGATGCC
GAACTTGCCG CGAGAACCGC CGCTGAAATC GATTTACGCC ACTCACGCTA TGGAATGGGC
CGTGAGTTGT TTGGGGCACT GCGTTCAAAT TTAAGCAGTC CAGAAACTGG TGCGCGCAGT
ACCAGCGCCA TTGACGAACT TTATTAA
 
Protein sequence
MHSVVQSVTD RIIARSKASR EAYLAALNDA RNHGVHRSSL SCGNLAHGFA ACSPDDKNSL 
RQLTKANIGI ITAFNDMLSA HQPYETYPEL LKKACQEVGS VAQVAGGVPA MCDGVTQGQP
GMELSLLSRE VIAMATAVGL SHNMFDGALL LGICDKIVPG LLIGALSFGH LPMLFVPAGP
MKSGIPNKEK ARIRQQFAQG KVDRAQLLEA EAQSYHSAGT CTFYGTANSN QLMLEVMGLQ
LPGSSFVNPD DPLREALNKM AAKQVCRLTE LGTQYSPIGE VVNEKSVVNG IVALLATGGS
TNLTMHIVAA ARAAGIIVNW DDFSELSDAV PLLARVYPNG HADINHFHAA GGMAFLIKEL
LDAGLLHEDV NTVAGFGLRR YTQEPKLLDG ELRWVDGPTV SLDTEVLTSV ATPFQNNGGL
KLLKGNLGRA VIKVSAVQEK HRVVEAPAVV IDDQNKLDAL FKSGALDRDC VVVVKGQGPK
ANGMPELHKL TPLLGSLQDK GFKVALMTDG RMSGASGKVP AAIHLTPEAI DGGLIAKVQD
GDLIRVDALT GELSLLVSDA ELAARTAAEI DLRHSRYGMG RELFGALRSN LSSPETGARS
TSAIDELY