Gene Shew185_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew185_2421 
Symbol 
ID5372534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS185 
KingdomBacteria 
Replicon accessionNC_009665 
Strand
Start bp2885468 
End bp2886559 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content51% 
IMG OID640830634 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001366621 
Protein GI153000940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCAC AATTTATCCC GAATGTTGCA CAAAAAATTC TGTTTATCGA CCGCGATGGC 
ACCTTGATTG AAGAACCCGT CACCGACAAA CAAGTCGATA GTCTGGCTAA ATTAGTGTTT
GAGCCACAGG TGATCCCCGC TTTGCTCCGT CTGCAAAAGG CGGGGTTTCG CTTAGTCATG
GTCAGCAATC AAGACGGCCT TGGCACACCG TCTTTCCCGC AGGAAGATTT CGACGCCCCC
CACAATATGA TGATGCAAAT CCTCAAAAGC CAAGGGGTGA ACTTTGAAGA TGTGGTGATT
TGCCCGCACT TTAATGATGA AAATTGCAGC TGCCGTAAAC CTAAGCTTGG GCTAATAAAA
GACTATCTCA CCCAAGGCTG TATCGATTTC ACCCAATCAG CTGTGATTGG CGATCGCGAA
ACCGATGTGG AGCTGGGCAA TGCCATGGGC ATTAAAAGCC TGAAATATCA ACGCGACACT
TTGGGCTGGA ACGCCATAGC CGATGCGCTG CTCAATAAAG GCCGCACCGC GACGGTCGTG
CGTACCACCA AAGAAACGGA CATTCGTGTG ACTGTGGATC TCGACAGTTC AGTCAAAGGA
AAAATCGACA CTGGCATTGG CTTTTTCGAC CACATGCTCG ATCAAATTGC CACCCACGGT
AATTTTAGAA TGGATGTGAA AGTCGATGGC GATTTAGAAA TCGACGATCA CCACAGCGTC
GAAGACACTG CGTTGGCCAT AGGTGATGCC CTGCGCCAAG CGCTCGGTGA CAAACGCGGC
ATTGCCCGTT TCGGCTTTAG CCTGCCGATG GACGAAGCCA AGGGCGAATG CCTGCTGGAT
ATCTCTGGCC GTCCTTTTAT CAAATTCTCA GCCGATTTTG AGCGTGAGCA CGTAGGCGAA
ATGGCCACAG AAATGGTGCC ACACTTCTTC CGCTCCTTTG CCGATGGCTT GCGCTGTACG
CTGCATGTGT CGACCGAAGG TGATAACGAT CACCACAAGG TTGAAGCCCT GTTTAAAGTG
CTCGGCCGCG CGCTACGCCA AGCGGTTAAA GTCGAAGGCG ATATTCTGCC ATCGAGCAAA
GGCGTACTTT AA
 
Protein sequence
MNPQFIPNVA QKILFIDRDG TLIEEPVTDK QVDSLAKLVF EPQVIPALLR LQKAGFRLVM 
VSNQDGLGTP SFPQEDFDAP HNMMMQILKS QGVNFEDVVI CPHFNDENCS CRKPKLGLIK
DYLTQGCIDF TQSAVIGDRE TDVELGNAMG IKSLKYQRDT LGWNAIADAL LNKGRTATVV
RTTKETDIRV TVDLDSSVKG KIDTGIGFFD HMLDQIATHG NFRMDVKVDG DLEIDDHHSV
EDTALAIGDA LRQALGDKRG IARFGFSLPM DEAKGECLLD ISGRPFIKFS ADFEREHVGE
MATEMVPHFF RSFADGLRCT LHVSTEGDND HHKVEALFKV LGRALRQAVK VEGDILPSSK
GVL