Gene SO_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_2071 
SymbolhisB 
ID1169816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp2167220 
End bp2168311 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content51% 
IMG OID637343938 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionNP_717674 
Protein GI24373631 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCAG TATTTACCCC TAATGTCGCG CAAAAAATAC TTTTTATCGA TCGCGATGGC 
ACCTTGATTG AAGAGCCGGT TACAGATAAG CAAGTCGATA ACCTTGCCAA GCTGGTATTC
GAGCCCCAAG TGATCCCCGC CTTACTGCGC CTGCAAAAAG CCGGTTTTCG TTTGGTGATG
GTCAGCAATC AGGATGGACT CGGTACCCCG TCCTTCCCGC AGGAAGATTT CGATGCGCCG
CACAATATGA TGATGCAAAT CCTCTCCAGC CAAGGGGTTA AGTTTGAAGA TGTGCTAATT
TGCCCACACT TTAACGATGA GAATTGTAGC TGCCGCAAAC CCAAGCTGGG ACTGGTGAAA
GACTTTTTGA CCCAAGGCTT TATCGATTTT ACCCAGTCCG CGGTGATTGG TGACAGACAC
ACAGATGTGG AACTGGGCAA TGCCATGGGG ATTATCAGCT TTCAATATCA GCGAGGCAGT
CTAGGTTGGA ACGCCATTGC CGATGCATTA CTCAACAAGG GCCGCAGCGC GACTGTGGTG
CGTACCACCA AGGAAACCGA TATTCGCGTG ACAGTCGATC TCGACAATGC CAGCAAAGGC
ACGATTAACA CTGGCATTGG CTTTTTCGAC CATATGCTGG ATCAAATCGC CACCCACGGG
AATTTCAAAA TGGAGGTGAA TGTCGATGGA GATCTCGAGA TAGACGATCA CCACAGCGTT
GAAGATACCG CATTGGCGAT TGGGGATGCA CTGCGCCAAG CGCTTGGCGA TAAACGCGGT
ATTGCCCGTT TTGGTTTTAG TTTGCCTATG GATGAGGCCA AGGGCGAATG CTTACTCGAT
CTTTCCGGCA GGCCTTTTAT TAAATTTGCT GCCCAATTTG AACGGGAAAA AGTCGGTGAA
ATGGCCACCG AAATGGTGCC GCACTTTTTC CGCTCCTTTG CCGATGGTTT GCGCTGCACC
CTGCATGTGG CCGCCGAGGG AGACAACGAT CACCACAAGG TAGAAGCACT CTTTAAGGTG
CTGGGCCGCG CACTGCGTCA AGCAGTAAAA GTGGAAGGTG ATGTATTGCC CTCGAGTAAA
GGCGTTCTCT AA
 
Protein sequence
MNPVFTPNVA QKILFIDRDG TLIEEPVTDK QVDNLAKLVF EPQVIPALLR LQKAGFRLVM 
VSNQDGLGTP SFPQEDFDAP HNMMMQILSS QGVKFEDVLI CPHFNDENCS CRKPKLGLVK
DFLTQGFIDF TQSAVIGDRH TDVELGNAMG IISFQYQRGS LGWNAIADAL LNKGRSATVV
RTTKETDIRV TVDLDNASKG TINTGIGFFD HMLDQIATHG NFKMEVNVDG DLEIDDHHSV
EDTALAIGDA LRQALGDKRG IARFGFSLPM DEAKGECLLD LSGRPFIKFA AQFEREKVGE
MATEMVPHFF RSFADGLRCT LHVAAEGDND HHKVEALFKV LGRALRQAVK VEGDVLPSSK
GVL