Gene Shewmr7_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_1991 
Symbol 
ID4256619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp2354513 
End bp2356246 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content50% 
IMG OID638122657 
Productdihydroxy-acid dehydratase 
Protein accessionYP_738037 
Protein GI114047487 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0134932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000282177 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATAATA AAAAACCGAA AACACTTCGT TCGGCTAGTT GGTTTGGTAG TGATGACAAA 
AATGGCTTTA TGTATCGCAG TTGGATGAAA AACCAAGGCA TACCCGAGCA TCACTTTCAA
AATAAGCCTG TGATTGGTAT TTGCAATACC TGGTCAGAAT TGACGCCCTG TAATGGTCAT
CTACGGGAAT TGGCGCAAAG AGTAAAGAAT GGCATTCGGG AAGCGGGTGG CATTCCAGTG
GAGTTTCCAG TGTTTTCGAA TGGCGAGTCC AACTTACGTC CAAGCGCCAT GCTTACCCGT
AATCTTGCTG CCATGGACAC GGAAGAAGCC ATTCGAGGCA ACCCCATCGA CGGAGTTGTG
CTGTTAGTGG GCTGTGATAA AACGACTCCG GCTTTATTGA TGGGCGCGGC CAGTTGTGAT
TTACCGACAA TCGTTGTTAC CGGTGGTCCC ATGCTCAATG GTAAGCATAA GGGCAAGGAT
GTCGGTTCGG GCACACTCGT GTGGGAACTG CATCAAGAAT ATAAAGCGGG CAACATCAGT
CTCGCCGCAT TTATGAATGC CGAAGCGGAT ATGTCACGCT CAACGGGCAC CTGCAACACT
ATGGGCACAG CATCGACCAT GGCCTGTATG GTGGAAACCC TTGGGGTGAG TTTGCCACAC
AATGCAGCCA TTCCTGCGGT GGATTCTCGC CGCCAAGTAT TGGCGCATAT GTCGGGAATG
CGAATTGTGG ACATGGTCAA AGAGGATTTG ACCTTAAGTA AAATTTTAAG CCGTGATGCT
TTTATTAATG CCATCAAAGT CAATGCTGCC ATTGGTGGTT CAACCAACGC CGTTATCCAT
TTAAAGGCGA TTGCCGGCAG GATAGGGGTA GAGCTGTCAC TCGATGACTG GCGCCATGGT
TACACTGTAC CGACCATAGT GAATCTTAAG CCTTCGGGTC AGTACTTAAT GGAAGACTTT
TACTACGCAG GTGGCCTGCC AGCAGTATTA AGGCAGCTGT TTGAACATGA TTTACTGAGC
AAAAACACGC TCACAGTCAA TGCCGCTAGC CTCTGGGACA ATGTCAAAGA GGCGCCTTGT
TATAACCAAG AGGTGATCAT GTCACTTGAA AATCCCTTGG TTGAAAATGG CGGCATTCGC
GTACTTCGCG GCAATCTCGC GCCCCGAGGC GCAGTGATCA AAACTTCAGC CGCCAGCGCA
CACCTGATGC AGCACCGCGG TAAAGCCGTG GTGTTTGAAA GCTTCGACGA TTACAACGCC
CGCATCGGCG ATCCTGAATT GGATATCGAT GAAAACAGCA TTATGGTGCT TAAAAACTGT
GGCCCGAAGG GATATCCGGG CATGGCTGAG GTCGGCAATA TGGGACTGCC ACCTAAGTTG
TTGAAAAAAG GAATTAAGGA CATGGTTAGG ATTTCTGATG CACGCATGAG TGGCACCGCC
TTTGGCACAG TTGTGCTGCA TGTTGCCCCA GAAGCACAAG CCCTTGGGCC ACTGGCCGCC
GTTCAAAATG GTGACATGAT AGCGCTTGAT ACCTATGCCG GAACGTTACA GCTGGAGATC
AGTGACCAAG AGTTACAAGC CCGTCTTGCC AAACTGGCAA CGGTGAAATC CATTCCAGTG
AATGGTGGCT ATCTCTCGCT CTTTAAGGAG CATGTTCTCC AGGCGGATGA GGGATGTGAT
TTTGATTTTC TCGTGGGATG TCGAGGTGCA GAGATACCAG CACATTCCCA TTAA
 
Protein sequence
MNNKKPKTLR SASWFGSDDK NGFMYRSWMK NQGIPEHHFQ NKPVIGICNT WSELTPCNGH 
LRELAQRVKN GIREAGGIPV EFPVFSNGES NLRPSAMLTR NLAAMDTEEA IRGNPIDGVV
LLVGCDKTTP ALLMGAASCD LPTIVVTGGP MLNGKHKGKD VGSGTLVWEL HQEYKAGNIS
LAAFMNAEAD MSRSTGTCNT MGTASTMACM VETLGVSLPH NAAIPAVDSR RQVLAHMSGM
RIVDMVKEDL TLSKILSRDA FINAIKVNAA IGGSTNAVIH LKAIAGRIGV ELSLDDWRHG
YTVPTIVNLK PSGQYLMEDF YYAGGLPAVL RQLFEHDLLS KNTLTVNAAS LWDNVKEAPC
YNQEVIMSLE NPLVENGGIR VLRGNLAPRG AVIKTSAASA HLMQHRGKAV VFESFDDYNA
RIGDPELDID ENSIMVLKNC GPKGYPGMAE VGNMGLPPKL LKKGIKDMVR ISDARMSGTA
FGTVVLHVAP EAQALGPLAA VQNGDMIALD TYAGTLQLEI SDQELQARLA KLATVKSIPV
NGGYLSLFKE HVLQADEGCD FDFLVGCRGA EIPAHSH