Gene Sbal_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_4029 
Symbol 
ID4844958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp4728856 
End bp4730730 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content54% 
IMG OID640121295 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001052364 
Protein GI126176215 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGT TACGATCAGC TACCAGTACC GAAGGCCGCA ATATGGCGGG TGCACGTGCG 
TTATGGCGTG CCACAGGGGT GAAAGACAAT GATTTTGGTA AGCCAATTAT CGCCATTGCT
AACTCCTTTA CTCAATTTGT ACCGGGCCAC GTGCACTTAA AAGACATGGG CTCACTCGTC
GCGAGCGCCA TCGAAGAAGC GGGCGGTATC GCAAAAGAAT TCAATACGAT CGCCGTCGAT
GACGGTATCG CCATGGGCCA CGGCGGCATG CTGTACAGTC TGCCATCGCG TGAACTTATC
GCCGACAGTG TGGAATACAT GGTTAACGCC CACTGCGCCG ATGCCTTAGT GTGTATCTCC
AACTGCGACA AGATCACCCC CGGCATGTTG ATGGCGGCGC TGCGCCTCAA TATTCCCGTA
GTGTTTGTGT CTGGTGGACC GATGGAAGCG GGTAAAACTA AGTTGTCGGA CAAACTCATC
AAACTCGATT TAGTCGATGC TATGGTGGCG GGCGCCGACT CGAACGTGAG CGATGAAGAC
AGTGCCAAAA TCGAGCGTAG CGCGTGCCCA ACCTGCGGTT CTTGCTCTGG TATGTTTACC
GCCAACTCAA TGAACTGTTT AACCGAAGCT CTAGGATTAT CGCTGCCGGG TAACGGCTCT
ATGCTGGCAA CCCACGCCGA TCGCCGCGAG CTGTTTTTAG AAGCGGGTCG TCGCGTGATG
GCGCTGGCAA AACGGTATTA TCATCAAGAT GATGAATCGG CATTGCCACG TAATATCGCC
AACTTTAAAG CCTTCGAAAA TGCTATGACC TTAGATATCG CCATGGGCGG TTCATCTAAC
ACCGTATTGC ATTTATTAGC CGCTGCGCAG GAAGCCGATG TTGATTTTAC CATGGCGGAT
ATCGACCGTA TGTCGCGCCT TGTGCCGCAC CTTTGTAAGG TTGCACCATC GACGCCTAAA
TACCATATGG AAGACGTGCA CCGTGCTGGC GGCGTGATGG GGATTTTGGG CGAGCTCGAC
AGAACCGGAT TACTGCATAC CGATGTGTTC CATGTGGCGG CCGACAATGA CGGCACGCCG
GGCAGCGGGA CCTTGAAATC GGTATTGGCC CAGTACGATG TGATGCAGAC GCAAGATGAA
AAAGTAAAAC ACTTCTTTAT GGCGGGACCT GCGGGCATTC CGACCACTAA AGCCTTTAGC
CAAGATTGTC GCTGGCCGTC ACTGGATAAT GACAGGCAAG AAGGCTGTAT CCGTAGCCGC
GAGTTTGCCT TCAGCCAAGA GGGCGGTCTT GCCGTATTGT CGGGCAACGT GGCCGAAAAT
GGCTGTATCG TTAAAACGGC GGGCGTGGAT GAATCGAATC TGACCTTTGT TGGCTCGGCG
CGCGTCTATG AAAGCCAAGA TGACGCCGTG GCGGGCATTT TAGGCGGCGA AGTGGTGGCG
GGGGATGTGG TTGTTATCCG TTATGAAGGC CCAAAAGGCG GCCCGGGTAT GCAAGAAATG
TTGTACCCAA CCAGTTACTT AAAATCTCGT GGCTTAGGCA AGGCCTGTGC GCTGATCACC
GACGGTCGTT TCTCAGGTGG CACTTCAGGT TTATCTATCG GTCACGTTTC ACCCGAAGCG
GCTGCGGGTG GCACAATCGC CTTGATTGAA AATGGCGATC GCATCGAAAT TGATATACCC
AAGCGCAGCA TCAAGTTGGC AGTAAGTGAT GTTGAACTCA ATGCTCGCCG CGAAAAAATG
CACAGTCTTG GCCCAATGGC GTGGAAACCT ATCGGTCGCC AACGTTATGT ATCACTCGCG
CTTAAGGCCT ACGCCATGCT CGCCACCAGT GCCGACAAGG GCGCGGTGCG CGATCGCAGT
AAACTGGAGG ACTAA
 
Protein sequence
MPKLRSATST EGRNMAGARA LWRATGVKDN DFGKPIIAIA NSFTQFVPGH VHLKDMGSLV 
ASAIEEAGGI AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADALVCIS
NCDKITPGML MAALRLNIPV VFVSGGPMEA GKTKLSDKLI KLDLVDAMVA GADSNVSDED
SAKIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS MLATHADRRE LFLEAGRRVM
ALAKRYYHQD DESALPRNIA NFKAFENAMT LDIAMGGSSN TVLHLLAAAQ EADVDFTMAD
IDRMSRLVPH LCKVAPSTPK YHMEDVHRAG GVMGILGELD RTGLLHTDVF HVAADNDGTP
GSGTLKSVLA QYDVMQTQDE KVKHFFMAGP AGIPTTKAFS QDCRWPSLDN DRQEGCIRSR
EFAFSQEGGL AVLSGNVAEN GCIVKTAGVD ESNLTFVGSA RVYESQDDAV AGILGGEVVA
GDVVVIRYEG PKGGPGMQEM LYPTSYLKSR GLGKACALIT DGRFSGGTSG LSIGHVSPEA
AAGGTIALIE NGDRIEIDIP KRSIKLAVSD VELNARREKM HSLGPMAWKP IGRQRYVSLA
LKAYAMLATS ADKGAVRDRS KLED