Gene Shew_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_2200 
Symbol 
ID4923114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp2560961 
End bp2562031 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID640163785 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001094325 
Protein GI127513128 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0671818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAC AGAAAATCCT CTTTATCGAC CGCGATGGCA CACTGATCGA GGAGCCAGTC 
ACCGACAAGC AGGTCGATAG CCTGGCCAAG CTGGTATTTG AGCCTCAGGT GATCCCCGCA
CTGCTCAAGC TACAGGGCGC CGGCTACCGC CTGGTGATGG TGAGCAATCA GGACGGCCTT
GGCACCCCCT CCTTCCCCAA GGATGACTTC GATGCGCCCC AGAATATGAT GATGCAGATC
TTCAACAGCC AGGGGGTCAA GTTCGATGAT GTGCTCATCT GCCCACACTT TGACGATGAA
AACTGTAGCT GCCGCAAACC TAAGCTGGGT CTGGTCAAGG CTTACCTCAC CGAGGGCCGG
GTCGACTTTA CTCAGTCGGC GGTGATCGGC GACAGAGAAA CCGATCTGGG CCTGGCCGAG
GCGATGGGCA TCACAGGCAT ACAGTACAAT CGCGACACCT TGAACTGGGA CGCCATTGCC
GAACAACTGC TTGGGGGCAA CCGCGTGGCG ACTGTGGTGC GTACCACCAA GGAGACCGAC
ATCAAGGTCA CTGTCGATCT CGACAGTCAG CTAAAGAGCA GCATCAATAC CGGCATCGGC
TTCTTCGACC ACATGCTGGA TCAAATCGCC ACCCACGGTA ACTTCAGGCT AGATGTGAGC
GTCGATGGCG ATCTGGAAAT CGACGATCAC CACAGCGTCG AAGACACGGC CCTGGCCATT
GGTGATGCCC TCAGGCAGGC CCTTGGGGAT AAACGCGGCA TCGCCCGCTT CGGCTTTAGC
ATCCCCATGG ATGAAGCCAG CGCCAGCTGC CTGCTGGATC TCTCCGGTCG CCCCTTCATC
AAGTTTGAGG GGCAGTTCGA GCGCGAGATG GTCGGCGAGA TGGCCACTGA GATGGTGCCT
CACTTCTTCC GCTCCCTCGC CGATGGCCTG CGCTGCACCC TGCACCTCTC GACCCAAGGC
GATAACGATC ACCACAAGGT GGAGAGCCTG TTTAAGGTCT TTGGCCGTAC CCTGCGCCAG
GCGGTGAAGG TCGAGGGCGA CGCCCTGCCA TCGAGCAAGG GGGTGCTATG A
 
Protein sequence
MMKQKILFID RDGTLIEEPV TDKQVDSLAK LVFEPQVIPA LLKLQGAGYR LVMVSNQDGL 
GTPSFPKDDF DAPQNMMMQI FNSQGVKFDD VLICPHFDDE NCSCRKPKLG LVKAYLTEGR
VDFTQSAVIG DRETDLGLAE AMGITGIQYN RDTLNWDAIA EQLLGGNRVA TVVRTTKETD
IKVTVDLDSQ LKSSINTGIG FFDHMLDQIA THGNFRLDVS VDGDLEIDDH HSVEDTALAI
GDALRQALGD KRGIARFGFS IPMDEASASC LLDLSGRPFI KFEGQFEREM VGEMATEMVP
HFFRSLADGL RCTLHLSTQG DNDHHKVESL FKVFGRTLRQ AVKVEGDALP SSKGVL