Gene Sbal223_3932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3932 
Symbol 
ID7086695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4682437 
End bp4684311 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content53% 
IMG OID643462808 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002359829 
Protein GI217975078 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGT TACGATCAGC TACCAGTACC GAAGGCCGCA ATATGGCGGG TGCACGTGCG 
TTATGGCGCG CCACAGGGGT GAAAGACAAT GATTTTGGTA AGCCAATTAT CGCCATTGCT
AACTCCTTTA CTCAATTTGT ACCGGGCCAC GTGCACTTAA AAGATATGGG CTCACTCGTC
GCGAGCGCCA TTGAAGAAGC GGGCGGTATC GCCAAAGAAT TCAATACGAT CGCCGTCGAT
GACGGTATCG CCATGGGCCA CGGCGGCATG CTGTACAGTC TGCCATCGCG CGAGCTTATC
GCCGACAGTG TGGAATACAT GGTTAACGCC CACTGCGCCG ATGCCTTAGT GTGTATCTCC
AACTGTGACA AGATCACTCC CGGCATGTTG ATGGCGGCGC TGCGCCTTAA TATTCCCGTC
GTGTTTGTCT CTGGCGGACC GATGGAAGCG GGTAAAACTA AGCTGTCGGA TAAGCTCATC
AAGCTCGATT TAGTCGATGC TATGGTAGCG GGCGCCGACT CGAATGTGAG CGATGAAGAC
AGTGCTAAAA TTGAGCGTAG CGCGTGCCCA ACCTGCGGCT CTTGCTCAGG CATGTTTACC
GCCAATTCAA TGAATTGTTT AACCGAAGCA CTAGGATTAT CGCTGCCGGG TAACGGCTCT
ATGCTGGCAA CTCACGCCGA TCGCCGCGAG CTGTTTTTAG AAGCGGGTCG CCGCGTGATG
GCGCTGGCAA AACGTTATTA TCATCAAGAT GATGAATCGG CATTGCCACG TAATATCGCA
AACTTTAAAG CCTTCGAAAA TGCTATGACC TTAGATATCG CCATGGGCGG TTCATCTAAC
ACCGTATTGC ATTTATTGGC CGCCGCGCAG GAAGCCGATG TTGATTTTAC CATGGCGGAT
ATCGACCGTA TGTCGCGCCT TGTGCCGCAC CTTTGTAAGG TTGCGCCATC GACGCCTAAA
TACCATATGG AAGACGTGCA CCGTGCGGGC GGCGTGATGG GGATTTTGGG CGAGCTCGAC
AGAGCCGGAT TATTGCATAC CGATGTGTTC CATGTGGCGG CCGATAATGA CGGCACGCCG
GGCAGCGGGA CCTTGAAATC GGTATTGGCC CAGTATGATG TAATGCAGAC GCAAGATGAA
AAAGTAAAAC ACTTCTTTAT GGCGGGGCCT GCGGGGATTC CGACCACTAA AGCCTTTAGC
CAAGATTGTC GCTGGCCGTC ACTGGATAAC GACAGACAAG AAGGCTGTAT CCGTAGCCGT
GAGTTTGCTT TCAGCCAAGA AGGTGGCCTT GCCGTATTGT CGGGCAACGT GGCCGAAAAC
GGCTGTATTG TTAAAACGGC GGGCGTGGAT GAATCGAATC TGACCTTTGT TGGCTCGGCG
CGCGTTTATG AAAGCCAAGA TGATGCCGTG GCGGGTATCT TAGGCGGCGA AGTGGTGGCG
GGTGATGTGG TTGTTATCCG TTACGAAGGC CCGAAAGGCG GCCCGGGTAT GCAAGAAATG
TTGTACCCAA CCAGTTACTT AAAATCACGT GGCTTAGGCA AGGCCTGTGC GCTGATCACC
GACGGTCGTT TCTCCGGTGG CACTTCAGGT TTATCTATCG GCCACGTTTC ACCCGAAGCG
GCAGCGGGCG GCACGATCGC CTTGATTGAA AATGGCGATC GCATCGAAAT TGATATTCCA
AAGCGCAGCA TCAAGTTGGC AGTAAGTGAT GTTGAACTCA ATGCTCGCCG CGAAAAAATG
CACAGTCTTG GCCCAATGGC GTGGAAACCT ATCGGTCGCC AACGTTATGT ATCACTCGCG
CTTAAGGCCT ACGCCATGCT CGCTACCAGT GCCGACAAGG GCGCGGTGCG CGATCGCAGT
AAACTGGAGG ACTAA
 
Protein sequence
MPKLRSATST EGRNMAGARA LWRATGVKDN DFGKPIIAIA NSFTQFVPGH VHLKDMGSLV 
ASAIEEAGGI AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADALVCIS
NCDKITPGML MAALRLNIPV VFVSGGPMEA GKTKLSDKLI KLDLVDAMVA GADSNVSDED
SAKIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS MLATHADRRE LFLEAGRRVM
ALAKRYYHQD DESALPRNIA NFKAFENAMT LDIAMGGSSN TVLHLLAAAQ EADVDFTMAD
IDRMSRLVPH LCKVAPSTPK YHMEDVHRAG GVMGILGELD RAGLLHTDVF HVAADNDGTP
GSGTLKSVLA QYDVMQTQDE KVKHFFMAGP AGIPTTKAFS QDCRWPSLDN DRQEGCIRSR
EFAFSQEGGL AVLSGNVAEN GCIVKTAGVD ESNLTFVGSA RVYESQDDAV AGILGGEVVA
GDVVVIRYEG PKGGPGMQEM LYPTSYLKSR GLGKACALIT DGRFSGGTSG LSIGHVSPEA
AAGGTIALIE NGDRIEIDIP KRSIKLAVSD VELNARREKM HSLGPMAWKP IGRQRYVSLA
LKAYAMLATS ADKGAVRDRS KLED