Gene Shewmr4_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1983 
Symbol 
ID4252556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2357999 
End bp2359732 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content50% 
IMG OID638118596 
Productdihydroxy-acid dehydratase 
Protein accessionYP_734113 
Protein GI113970320 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000233691 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0422763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATA AAAAACCGAA AACACTTCGT TCGGCTAGTT GGTTTGGTAG TGATGACAAA 
AATGGCTTTA TGTATCGCAG TTGGATGAAA AACCAAGGCA TACCCGAGCA TCACTTTCAA
AATAAGCCTG TAATTGGTAT TTGCAATACC TGGTCAGAAT TGACGCCCTG TAATGGTCAT
CTACGGGAAT TGGCGCAAAG AGTAAAGAAT GGCATTCGGG AAGCGGGTGG CATTCCAGTG
GAGTTTCCAG TGTTTTCGAA TGGCGAGTCC AACTTACGTC CAAGCGCCAT GCTTACCCGT
AATCTTGCTG CCATGGACAC GGAAGAAGCC ATTCGAGGCA ACCCCATCGA CGGAGTTGTG
CTGTTAGTGG GCTGTGATAA AACGACTCCG GCTTTATTGA TGGGCGCGGC CAGTTGTGAT
TTACCGACAA TCGTTGTTAC CGGTGGTCCC ATGCTCAATG GTAAGCATAA GGGCAAGGAT
GTCGGTTCGG GCACTCTCGT ATGGGAACTG CATCAAGAAT ATAAAGCGGG CAACATCAGT
CTCGCCGCAT TTATGAATGC CGAAGCGGAT ATGTCACGCT CAACGGGCAC CTGCAACACT
ATGGGCACAG CATCGACCAT GGCCTGTATG GTGGAAACCC TTGGGGTGAG TTTGCCACAC
AATGCAGCCA TTCCTGCGGT GGATTCTCGC CGCCAAGTAT TGGCGCATAT GTCGGGAATG
CGAATTGTGG ACATGGTCAA AGAGGATTTG ACCTTAAGTA AAATTTTAAG CCGTGATGCT
TTTATCAATG CCATCAAAGT CAATGCTGCC ATTGGTGGTT CAACCAACGC CGTTATCCAT
TTAAAGGCGA TTGCCGGCAG GATAGGGGTT GAGCTGTCAC TCGATGACTG GCGCCATGGT
TACACAGTAC CGACCATAGT GAATCTTAAG CCTTCGGGTC AGTACTTAAT GGAAGACTTT
TACTACGCAG GTGGCCTGCC AGCAGTATTA AGGCAGCTGT TTGAGCATGA TTTACTGAGC
AAAAACACGC TTACAGTCAA TGCCGCTAGC CTCTGGGACA ATGTCAAAGA GGCGCCTTGT
TATAACCAAG AGGTGATCAT GTCACTTGAA AATCCCTTGG TTGAAAATGG CGGCATTCGC
GTACTTCGCG GCAATCTCGC GCCCCGAGGC GCAGTGATCA AAACGTCAGC CGCCAGCGCA
CACCTGATGC AACACCGCGG TAAAGCCGTG GTGTTTGAAA GCTTCGACGA TTACAACGCC
CGCATCGGCG ATCCTGAATT GGATATCGAT GAAAACAGCA TTATGGTGCT TAAAAACTGT
GGCCCGAAGG GATATCCGGG CATGGCTGAG GTCGGCAATA TGGGACTGCC ACCTAAGTTG
TTGAAAAAAG GAATTAAGGA CATGGTTAGG ATTTCTGATG CACGCATGAG TGGCACCGCC
TTTGGCACAG TTGTGCTGCA TGTTGCCCCA GAAGCACAAG CCCTTGGGCC ACTGGCCGCC
GTTCAAAATG GTGACATGAT AGCGCTTGAT ACCTATGCCG GAACGTTACA GCTGGAGATC
AGTGACCAAG AGTTACAAGC CCGTCTTGCC AAACTGGCGA CGGTGAAATC CATTCCCGTG
AATGGTGGCT ATCTCTCGCT ATTTAAGGAG CATGTTCTCC AGGCGGATGA GGGATGTGAT
TTTGATTTTC TCGTGGGATG TCGAGGTGCA GAGATACCAG CACATTCCCA TTAA
 
Protein sequence
MNNKKPKTLR SASWFGSDDK NGFMYRSWMK NQGIPEHHFQ NKPVIGICNT WSELTPCNGH 
LRELAQRVKN GIREAGGIPV EFPVFSNGES NLRPSAMLTR NLAAMDTEEA IRGNPIDGVV
LLVGCDKTTP ALLMGAASCD LPTIVVTGGP MLNGKHKGKD VGSGTLVWEL HQEYKAGNIS
LAAFMNAEAD MSRSTGTCNT MGTASTMACM VETLGVSLPH NAAIPAVDSR RQVLAHMSGM
RIVDMVKEDL TLSKILSRDA FINAIKVNAA IGGSTNAVIH LKAIAGRIGV ELSLDDWRHG
YTVPTIVNLK PSGQYLMEDF YYAGGLPAVL RQLFEHDLLS KNTLTVNAAS LWDNVKEAPC
YNQEVIMSLE NPLVENGGIR VLRGNLAPRG AVIKTSAASA HLMQHRGKAV VFESFDDYNA
RIGDPELDID ENSIMVLKNC GPKGYPGMAE VGNMGLPPKL LKKGIKDMVR ISDARMSGTA
FGTVVLHVAP EAQALGPLAA VQNGDMIALD TYAGTLQLEI SDQELQARLA KLATVKSIPV
NGGYLSLFKE HVLQADEGCD FDFLVGCRGA EIPAHSH