Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_2062 |
Symbol | |
ID | 5079488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | - |
Start bp | 2356112 |
End bp | 2357845 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640499224 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001183582 |
Protein GI | 146293158 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000158328 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAATA AAAAAACAAA AGCACTTCGT TCCGCAAGCT GGTTTGGTAG CGATGATAAA AATGGTTTTA TGTATCGTAG TTGGATGAAA AACCAAGGCA TTCCAGAGCA TCACTTTCAC AACCGGCCTG TTATTGGTAT TTGCAATACG TGGTCGGAGT TAACGCCCTG CAACGGTCAT CTGCGGGAAT TGGCTGAAAG AGTTAAAAAT GGGATCCGTG AAGCAGGGGG GATCCCAGTG GAATTTCCAG TATTTTCCAA TGGTGAATCT AATTTACGCC CCAGTGCCAT GTTAACGCGT AATTTGGCCG CGATGGATAC CGAAGAAGCG ATTCGAGGCA ATCCGATTGA TGGTGTGGTG CTCTTAGTGG GTTGTGATAA AACAACGCCG GCTTTACTCA TGGGGGCGGC CAGTTGCAAT TTACCCACCA TAGTCGTGAC GGGTGGGCCT ATGCTAAATG GTAAACATAA AGGGAAAGAT GTAGGGTCTG GCACTTTAGT CTGGGAACTG CATCAAGAAT ATAAAGCAGG AAATATTAGC TTGGCTGAGT TTATGAATGC CGAAGCCGAT ATGTCACGTT CAACGGGTAC TTGTAATACC ATGGGAACGG CATCGACTAT GGCCTGCATG GCAGAAAGTT TAGGCACCAG TTTGCCACAA AATGCGGCTA TTCCTGCTGT TGATTCAAGA CGTTATGTCT TAGCCCATAT GTCTGGAATG CGTATTGTCG ATATGGTCCA TGAGGATTTA ACGCTTTCAA AAGTGCTCAC GCGTGAAGCA TTTATCAATG CGATAAAAAC CAATGCGGCG ATTGGCGGCT CGACCAATGC GGTTATCCAC TTAAAAGCCA TAGCGGGGCG CATAGGGGTT GATTTGTCAC TGGATGATTG GTCGTATGGC TATGATGTAC CGACGATAGT CAATCTTAAG CCCTCGGGTC AGTATTTGAT GGAGGATTTT TACTATGCAG GAGGTTTACC CGCTGTGCTA AAAGAACTCT TTAACCATCA TTTATTGAGT AAAAATACCT TAACAGTGAA TGGGAAAACG CTCTGGGAAA ATGTGGCAAA TGCACCTTGT TACAACCGAG ACGTGATCAT GAGTATTGAT ACACCCTTAG TTGAAAATGG TGGGATCAGA GTATTAAAGG GGAACTTGGC CCCTCGAGGT GCAGTGATTA AACCTTCTGC AGCCAGTCCG CATTTAATGA AACATCGTGG TAAAGCGGTC GTGTTTGAGA GTTTTGATGA TTACAACGCC CGCATAAATG ATCCCGAACT TGCGATTGAT GAAACGAGCA TCATGGTGCT CAAAAATTGC GGCCCGAAGG GATATCCGGG TATGGCCGAG GTCGGCAATA TGGGATTGCC ACCTAAGCTG TTAAAAAAAG GGATTAAAGA TATGGTTAGG ATTTCGGATG CGCGAATGAG TGGTACTGCA TTTGGGACAG TCGTATTGCA TGTTGTCCCC GAGTCACAGG ACTTAGGCCC ACTAGCTGCG GTACAAAATG GTGATGTGAT CGCACTCGAT ACTTTTGCTG GCGTACTGCA ACTAGAGATT AGTGACGAAG AATTAGCGGA TCGGTTAGCA AAAATCGCCT TGGTTAAGCA ACTTCCTGTC AGCAGTGGTT ATTTATCCCT TTTTAGGGAA AGAGTGTTGC AGGCCGATGA AGGGTGTGAT TTTGATTTTC TAGTGGGATG TCGGGGTGCT GATATTCCGG CACATTCCCA TTAA
|
Protein sequence | MQNKKTKALR SASWFGSDDK NGFMYRSWMK NQGIPEHHFH NRPVIGICNT WSELTPCNGH LRELAERVKN GIREAGGIPV EFPVFSNGES NLRPSAMLTR NLAAMDTEEA IRGNPIDGVV LLVGCDKTTP ALLMGAASCN LPTIVVTGGP MLNGKHKGKD VGSGTLVWEL HQEYKAGNIS LAEFMNAEAD MSRSTGTCNT MGTASTMACM AESLGTSLPQ NAAIPAVDSR RYVLAHMSGM RIVDMVHEDL TLSKVLTREA FINAIKTNAA IGGSTNAVIH LKAIAGRIGV DLSLDDWSYG YDVPTIVNLK PSGQYLMEDF YYAGGLPAVL KELFNHHLLS KNTLTVNGKT LWENVANAPC YNRDVIMSID TPLVENGGIR VLKGNLAPRG AVIKPSAASP HLMKHRGKAV VFESFDDYNA RINDPELAID ETSIMVLKNC GPKGYPGMAE VGNMGLPPKL LKKGIKDMVR ISDARMSGTA FGTVVLHVVP ESQDLGPLAA VQNGDVIALD TFAGVLQLEI SDEELADRLA KIALVKQLPV SSGYLSLFRE RVLQADEGCD FDFLVGCRGA DIPAHSH
|
| |