Gene Sbal223_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2144 
Symbol 
ID7085950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2546847 
End bp2548580 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content46% 
IMG OID643461046 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002358070 
Protein GI217973319 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00293618 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.802543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATA AAAAAACGAA AGCACTTCGT TCTGCGAGTT GGTTTGGGAG TGATGACAAA 
AATGGCTTTA TGTATCGCAG CTGGATGAAA AACCAAGGCA TACCAGACCA TCATTTTCAA
AATAAACCTG TGATCGGCAT CTGTAACACT TGGTCAGAAC TTACACCTTG TAACGGCCAT
CTGCGCGATT TAGCACAAAG GGTCAAAAAC GGCATACGTG AGGCAGGGGG AATACCGGTT
GAATTCCCCG TATTTTCCAA TGGCGAATCC AATTTACGTC CAAGCGCTAT GCTGACACGT
AACTTAGCAG CGATGGATAC TGAAGAAGCG ATTCGCGGTA ATCCGATTGA TGGTGTTGTC
TTATTGGTTG GCTGCGATAA AACCACCCCC GCATTATTGA TGGGCGCCGC CAGTTGTAAC
TTGCCGACGA TAGTCGTCAC TGGCGGACCT ATGCTCAATG GTAAGCACAA AGGTAAAGAC
GTGGGCTCAG GAACTCTAGT CTGGGAACTG CACCAAGAGT ATAAAGCGGG CAATATCAGT
CTTGCAGAGT TTATGAATGC TGAAGCCGAT ATGTCTCGCT CAACGGGCAC CTGTAATACC
ATGGGCACAG CATCAACTAT GGCTTGTATG GCGGAAAGTT TAGGCACAAG TTTACCGCAA
AATGCCGCCA TTCCTGCCGT AGATTCACGG CGTAATGTGT TGGCCCACAT GTCTGGCATG
CGTATTGTTG ACATGGTGCA TGAGGATTTA ACGCTGTCGA AAGTATTGAC CCGCGAGGCT
TTTATCAATG CGATAAAAAC CAATGCGGCG ATTGGCGGCT CGACCAACGC GGTGATCCAT
TTAAAAGCGA TAGCAGGTCG TATTGGTGTT GAATTGTCAT TGGACGATTG GTCACACGGC
TACGATGTGC CCACCATAGT GAATCTTAAA CCTTCAGGTC AGTACTTGAT GGAAGACTTT
TATTATGCAG GAGGTTTACC CGCAGTGTTA AAGCAGCTGT TTAATAAAAA TTTATTGAAT
AAAAACACTT TAACAGTGAA CGGCCAAACC CTGTGGGCAA ATGTAGTGGA TGCGCCTTGC
TACAATAAAG AGGTCATCAT GAACATCGAT GCGCCCTTAG TTGAAAATGG TGGGATTCGG
ATATTAAGGG GAAATCTTGC TCCCCGAGGC GCAGTAATTA AGCCTTCGGC GGCCAGTCCT
CATTTAATGA AACACAGTGG TAAAGCTGTG GTTTTTGAGA GTTTTGATGA CTATAACGCT
CGTATAAACT CTCCAGAATT GGATATTGAT GAAACCAGTA TTATGGTGCT CAAGAATTGC
GGCCCCAAGG GATATCCGGG CATGGCGGAG GTGGGTAATA TGGGATTACC ACCTAAGCTA
TTGAAAAAAG GCATTAAAGA TATGGTCAGG ATTTCCGATG CGCGCATGAG TGGCACCGCA
TTTGGCACTG TAGTCTTGCA TGTTGCACCC GAAGCGCAGG ATTTAGGTCC CTTAGCGGCG
GTGCAAAATG GCGATATGAT CACGCTTGAT ACCTTTGCGG GTATTCTGCA ACTTGAGATC
AGCGCTGACG AATTAGCAAA TCGATTGGCT AAGTTAGCCT CGGTGAAACC CGTTCCCATC
GGCACTGGAT ATTTGTCTCT TTTTAAAGAA AGAGTGCTGC AAGCGGACGA AGGTTGTGAC
TTTGATTTTC TAGTGGGATG TCGAGGTGCT GATATTCCGG CACATTCCCA TTAA
 
Protein sequence
MNNKKTKALR SASWFGSDDK NGFMYRSWMK NQGIPDHHFQ NKPVIGICNT WSELTPCNGH 
LRDLAQRVKN GIREAGGIPV EFPVFSNGES NLRPSAMLTR NLAAMDTEEA IRGNPIDGVV
LLVGCDKTTP ALLMGAASCN LPTIVVTGGP MLNGKHKGKD VGSGTLVWEL HQEYKAGNIS
LAEFMNAEAD MSRSTGTCNT MGTASTMACM AESLGTSLPQ NAAIPAVDSR RNVLAHMSGM
RIVDMVHEDL TLSKVLTREA FINAIKTNAA IGGSTNAVIH LKAIAGRIGV ELSLDDWSHG
YDVPTIVNLK PSGQYLMEDF YYAGGLPAVL KQLFNKNLLN KNTLTVNGQT LWANVVDAPC
YNKEVIMNID APLVENGGIR ILRGNLAPRG AVIKPSAASP HLMKHSGKAV VFESFDDYNA
RINSPELDID ETSIMVLKNC GPKGYPGMAE VGNMGLPPKL LKKGIKDMVR ISDARMSGTA
FGTVVLHVAP EAQDLGPLAA VQNGDMITLD TFAGILQLEI SADELANRLA KLASVKPVPI
GTGYLSLFKE RVLQADEGCD FDFLVGCRGA DIPAHSH