Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1983 |
Symbol | |
ID | 4252556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2357999 |
End bp | 2359732 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638118596 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_734113 |
Protein GI | 113970320 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000233691 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0422763 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATA AAAAACCGAA AACACTTCGT TCGGCTAGTT GGTTTGGTAG TGATGACAAA AATGGCTTTA TGTATCGCAG TTGGATGAAA AACCAAGGCA TACCCGAGCA TCACTTTCAA AATAAGCCTG TAATTGGTAT TTGCAATACC TGGTCAGAAT TGACGCCCTG TAATGGTCAT CTACGGGAAT TGGCGCAAAG AGTAAAGAAT GGCATTCGGG AAGCGGGTGG CATTCCAGTG GAGTTTCCAG TGTTTTCGAA TGGCGAGTCC AACTTACGTC CAAGCGCCAT GCTTACCCGT AATCTTGCTG CCATGGACAC GGAAGAAGCC ATTCGAGGCA ACCCCATCGA CGGAGTTGTG CTGTTAGTGG GCTGTGATAA AACGACTCCG GCTTTATTGA TGGGCGCGGC CAGTTGTGAT TTACCGACAA TCGTTGTTAC CGGTGGTCCC ATGCTCAATG GTAAGCATAA GGGCAAGGAT GTCGGTTCGG GCACTCTCGT ATGGGAACTG CATCAAGAAT ATAAAGCGGG CAACATCAGT CTCGCCGCAT TTATGAATGC CGAAGCGGAT ATGTCACGCT CAACGGGCAC CTGCAACACT ATGGGCACAG CATCGACCAT GGCCTGTATG GTGGAAACCC TTGGGGTGAG TTTGCCACAC AATGCAGCCA TTCCTGCGGT GGATTCTCGC CGCCAAGTAT TGGCGCATAT GTCGGGAATG CGAATTGTGG ACATGGTCAA AGAGGATTTG ACCTTAAGTA AAATTTTAAG CCGTGATGCT TTTATCAATG CCATCAAAGT CAATGCTGCC ATTGGTGGTT CAACCAACGC CGTTATCCAT TTAAAGGCGA TTGCCGGCAG GATAGGGGTT GAGCTGTCAC TCGATGACTG GCGCCATGGT TACACAGTAC CGACCATAGT GAATCTTAAG CCTTCGGGTC AGTACTTAAT GGAAGACTTT TACTACGCAG GTGGCCTGCC AGCAGTATTA AGGCAGCTGT TTGAGCATGA TTTACTGAGC AAAAACACGC TTACAGTCAA TGCCGCTAGC CTCTGGGACA ATGTCAAAGA GGCGCCTTGT TATAACCAAG AGGTGATCAT GTCACTTGAA AATCCCTTGG TTGAAAATGG CGGCATTCGC GTACTTCGCG GCAATCTCGC GCCCCGAGGC GCAGTGATCA AAACGTCAGC CGCCAGCGCA CACCTGATGC AACACCGCGG TAAAGCCGTG GTGTTTGAAA GCTTCGACGA TTACAACGCC CGCATCGGCG ATCCTGAATT GGATATCGAT GAAAACAGCA TTATGGTGCT TAAAAACTGT GGCCCGAAGG GATATCCGGG CATGGCTGAG GTCGGCAATA TGGGACTGCC ACCTAAGTTG TTGAAAAAAG GAATTAAGGA CATGGTTAGG ATTTCTGATG CACGCATGAG TGGCACCGCC TTTGGCACAG TTGTGCTGCA TGTTGCCCCA GAAGCACAAG CCCTTGGGCC ACTGGCCGCC GTTCAAAATG GTGACATGAT AGCGCTTGAT ACCTATGCCG GAACGTTACA GCTGGAGATC AGTGACCAAG AGTTACAAGC CCGTCTTGCC AAACTGGCGA CGGTGAAATC CATTCCCGTG AATGGTGGCT ATCTCTCGCT ATTTAAGGAG CATGTTCTCC AGGCGGATGA GGGATGTGAT TTTGATTTTC TCGTGGGATG TCGAGGTGCA GAGATACCAG CACATTCCCA TTAA
|
Protein sequence | MNNKKPKTLR SASWFGSDDK NGFMYRSWMK NQGIPEHHFQ NKPVIGICNT WSELTPCNGH LRELAQRVKN GIREAGGIPV EFPVFSNGES NLRPSAMLTR NLAAMDTEEA IRGNPIDGVV LLVGCDKTTP ALLMGAASCD LPTIVVTGGP MLNGKHKGKD VGSGTLVWEL HQEYKAGNIS LAAFMNAEAD MSRSTGTCNT MGTASTMACM VETLGVSLPH NAAIPAVDSR RQVLAHMSGM RIVDMVKEDL TLSKILSRDA FINAIKVNAA IGGSTNAVIH LKAIAGRIGV ELSLDDWRHG YTVPTIVNLK PSGQYLMEDF YYAGGLPAVL RQLFEHDLLS KNTLTVNAAS LWDNVKEAPC YNQEVIMSLE NPLVENGGIR VLRGNLAPRG AVIKTSAASA HLMQHRGKAV VFESFDDYNA RIGDPELDID ENSIMVLKNC GPKGYPGMAE VGNMGLPPKL LKKGIKDMVR ISDARMSGTA FGTVVLHVAP EAQALGPLAA VQNGDMIALD TYAGTLQLEI SDQELQARLA KLATVKSIPV NGGYLSLFKE HVLQADEGCD FDFLVGCRGA EIPAHSH
|
| |