Gene Sbal223_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1930 
Symbol 
ID7090097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2280390 
End bp2281481 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content51% 
IMG OID643460834 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_002357858 
Protein GI217973107 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.433526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.190721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAC AATTTATCCC GAACGTTGCA CAAAAAATTC TGTTTATCGA CCGCGATGGC 
ACCTTGATTG AAGAACCCGT CACCGACAAA CAGGTCGATA CTCTGGCTAA ATTAGTGTTT
GAGCCACAGG TGATCCCCGC TTTACTCCGT CTGCAAAAGG CAGGATTTCG TTTAGTCATG
GTCAGCAATC AAGACGGCCT TGGCACACCG TCTTTCCCGC AGGAAGATTT CGACGCGCCC
CACAATATGA TGATGCAAAT CCTCAAAAGC CAAGGGGTTA ACTTCGAAGA TGTGGTGATT
TGCCCGCACT TTAACGATGA AAATTGCAGC TGCCGTAAAC CTAAGCTTGG GCTAGTAAAA
GACTATCTCA CCCAAGGCTG CATCGATTTC ACTCAATCGG CGGTGATTGG CGATCGCGAA
ACCGATGTGG AACTGGGCAA TGCCATGGGC ATTAAAAGCC TGAAATATCA ACGCGAAACT
TTGGGCTGGA ACGCCATCGC CGATGCGCTG CTCAATAAAG GCCGCACCGC GACGGTAGTG
CGCACCACCA AAGAAACCGA TATTCGTGTG ACTGTGGATC TCGACAGTTC AGTCAAAGGA
AAAATCGACA CTGGCATTGG CTTTTTCGAC CACATGCTCG ATCAAATTGC CACCCACGGT
AATTTTAGAA TGGATGTGAA AGTCGATGGC GACTTAGAAA TCGACGATCA CCACAGCGTC
GAAGACACGG CGCTAGCCAT AGGTGATGCC CTGCGCCAAG CGCTCGGTGA CAAACGCGGC
ATTGCCCGTT TCGGCTTTAG TCTGCCGATG GACGAAGCCA AGGGCGAATG TCTGCTGGAT
ATCTCTGGCC GTCCTTTTAT CAAATTCTCG GCCGATTTTG AACGTGAGCA CGTAGGCGAA
ATGGCCACAG AAATGGTGCC ACACTTCTTC CGCTCCTTTG CCGATGGCCT GCGCTGTACG
CTGCATGTGT CGACCGAAGG CGATAACGAT CACCACAAGG TCGAAGCCCT GTTTAAAGTG
CTCGGCCGCG CGCTACGCCA AGCGGTTAAA GTCGAAGGCG ATATTCTGCC ATCGAGCAAA
GGCGTGCTTT AA
 
Protein sequence
MNPQFIPNVA QKILFIDRDG TLIEEPVTDK QVDTLAKLVF EPQVIPALLR LQKAGFRLVM 
VSNQDGLGTP SFPQEDFDAP HNMMMQILKS QGVNFEDVVI CPHFNDENCS CRKPKLGLVK
DYLTQGCIDF TQSAVIGDRE TDVELGNAMG IKSLKYQRET LGWNAIADAL LNKGRTATVV
RTTKETDIRV TVDLDSSVKG KIDTGIGFFD HMLDQIATHG NFRMDVKVDG DLEIDDHHSV
EDTALAIGDA LRQALGDKRG IARFGFSLPM DEAKGECLLD ISGRPFIKFS ADFEREHVGE
MATEMVPHFF RSFADGLRCT LHVSTEGDND HHKVEALFKV LGRALRQAVK VEGDILPSSK
GVL