Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2144 |
Symbol | |
ID | 7085950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 2546847 |
End bp | 2548580 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643461046 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002358070 |
Protein GI | 217973319 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00293618 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.802543 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATA AAAAAACGAA AGCACTTCGT TCTGCGAGTT GGTTTGGGAG TGATGACAAA AATGGCTTTA TGTATCGCAG CTGGATGAAA AACCAAGGCA TACCAGACCA TCATTTTCAA AATAAACCTG TGATCGGCAT CTGTAACACT TGGTCAGAAC TTACACCTTG TAACGGCCAT CTGCGCGATT TAGCACAAAG GGTCAAAAAC GGCATACGTG AGGCAGGGGG AATACCGGTT GAATTCCCCG TATTTTCCAA TGGCGAATCC AATTTACGTC CAAGCGCTAT GCTGACACGT AACTTAGCAG CGATGGATAC TGAAGAAGCG ATTCGCGGTA ATCCGATTGA TGGTGTTGTC TTATTGGTTG GCTGCGATAA AACCACCCCC GCATTATTGA TGGGCGCCGC CAGTTGTAAC TTGCCGACGA TAGTCGTCAC TGGCGGACCT ATGCTCAATG GTAAGCACAA AGGTAAAGAC GTGGGCTCAG GAACTCTAGT CTGGGAACTG CACCAAGAGT ATAAAGCGGG CAATATCAGT CTTGCAGAGT TTATGAATGC TGAAGCCGAT ATGTCTCGCT CAACGGGCAC CTGTAATACC ATGGGCACAG CATCAACTAT GGCTTGTATG GCGGAAAGTT TAGGCACAAG TTTACCGCAA AATGCCGCCA TTCCTGCCGT AGATTCACGG CGTAATGTGT TGGCCCACAT GTCTGGCATG CGTATTGTTG ACATGGTGCA TGAGGATTTA ACGCTGTCGA AAGTATTGAC CCGCGAGGCT TTTATCAATG CGATAAAAAC CAATGCGGCG ATTGGCGGCT CGACCAACGC GGTGATCCAT TTAAAAGCGA TAGCAGGTCG TATTGGTGTT GAATTGTCAT TGGACGATTG GTCACACGGC TACGATGTGC CCACCATAGT GAATCTTAAA CCTTCAGGTC AGTACTTGAT GGAAGACTTT TATTATGCAG GAGGTTTACC CGCAGTGTTA AAGCAGCTGT TTAATAAAAA TTTATTGAAT AAAAACACTT TAACAGTGAA CGGCCAAACC CTGTGGGCAA ATGTAGTGGA TGCGCCTTGC TACAATAAAG AGGTCATCAT GAACATCGAT GCGCCCTTAG TTGAAAATGG TGGGATTCGG ATATTAAGGG GAAATCTTGC TCCCCGAGGC GCAGTAATTA AGCCTTCGGC GGCCAGTCCT CATTTAATGA AACACAGTGG TAAAGCTGTG GTTTTTGAGA GTTTTGATGA CTATAACGCT CGTATAAACT CTCCAGAATT GGATATTGAT GAAACCAGTA TTATGGTGCT CAAGAATTGC GGCCCCAAGG GATATCCGGG CATGGCGGAG GTGGGTAATA TGGGATTACC ACCTAAGCTA TTGAAAAAAG GCATTAAAGA TATGGTCAGG ATTTCCGATG CGCGCATGAG TGGCACCGCA TTTGGCACTG TAGTCTTGCA TGTTGCACCC GAAGCGCAGG ATTTAGGTCC CTTAGCGGCG GTGCAAAATG GCGATATGAT CACGCTTGAT ACCTTTGCGG GTATTCTGCA ACTTGAGATC AGCGCTGACG AATTAGCAAA TCGATTGGCT AAGTTAGCCT CGGTGAAACC CGTTCCCATC GGCACTGGAT ATTTGTCTCT TTTTAAAGAA AGAGTGCTGC AAGCGGACGA AGGTTGTGAC TTTGATTTTC TAGTGGGATG TCGAGGTGCT GATATTCCGG CACATTCCCA TTAA
|
Protein sequence | MNNKKTKALR SASWFGSDDK NGFMYRSWMK NQGIPDHHFQ NKPVIGICNT WSELTPCNGH LRDLAQRVKN GIREAGGIPV EFPVFSNGES NLRPSAMLTR NLAAMDTEEA IRGNPIDGVV LLVGCDKTTP ALLMGAASCN LPTIVVTGGP MLNGKHKGKD VGSGTLVWEL HQEYKAGNIS LAEFMNAEAD MSRSTGTCNT MGTASTMACM AESLGTSLPQ NAAIPAVDSR RNVLAHMSGM RIVDMVHEDL TLSKVLTREA FINAIKTNAA IGGSTNAVIH LKAIAGRIGV ELSLDDWSHG YDVPTIVNLK PSGQYLMEDF YYAGGLPAVL KQLFNKNLLN KNTLTVNGQT LWANVVDAPC YNKEVIMNID APLVENGGIR ILRGNLAPRG AVIKPSAASP HLMKHSGKAV VFESFDDYNA RINSPELDID ETSIMVLKNC GPKGYPGMAE VGNMGLPPKL LKKGIKDMVR ISDARMSGTA FGTVVLHVAP EAQDLGPLAA VQNGDMITLD TFAGILQLEI SADELANRLA KLASVKPVPI GTGYLSLFKE RVLQADEGCD FDFLVGCRGA DIPAHSH
|
| |