Gene Sbal195_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_2539 
Symbol 
ID5754298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp3002657 
End bp3003748 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content51% 
IMG OID641288833 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001554967 
Protein GI160875651 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0279147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAC AATTTATCCC GAACGTTGCA CAAAAAATTC TGTTTATCGA CCGCGATGGC 
ACCTTGATTG AAGAGCCCGT CACCGACAAA CAGGTCGATA GTCTGGCTAA ATTAGTGTTC
GAGCCACAGG TGATCCCAGC TTTGCTACGT CTGCAAAAGG CAGGATTTCG TTTGGTCATG
GTCAGCAATC AAGACGGCCT TGGCACACCG TCTTTCCCGC AGGAAGATTT CGACGCGCCC
CACAATATGA TGATGCAAAT CCTCAAAAGC CAAGGGGTGA ACTTCGAAGA TGTGGTGATT
TGCCCGCACT TTAACGATGA AAATTGCAGC TGCCGTAAAC CTAAGCTTGG GCTAGTAAAA
GACTATCTCA CCCAAGGCTG CATCGATTTC ACCCAATCAG CTGTGATTGG CGATCGCGAA
ACGGATGTGG AACTGGGCAA TGCCATGGGC ATTAAAAGCC TTAAATATCA ACGAGACATG
TTGGGCTGGA ACGCCATCGC CGATGCGCTG CTCAATAAAG GCCGCACCGC GACGGTCGTG
CGCACCACCA AAGAAACGGA CATTCGTGTG ACTGTGGATC TCGACAGCCC CGTTAAAGGC
AAAATCGACA CTGGTATTGG CTTTTTCGAC CATATGCTCG ATCAAATTGC CACCCATGGT
AATTTTAGAA TGGATGTGAA AGTCGATGGC GATTTAGAAA TCGACGATCA CCACAGCATC
GAAGACACGG CGCTAGCCAT AGGTGATGCC CTTCGCCAAG CGCTCGGTGA CAAACGCGGC
ATTGCCCGTT TCGGCTTTAG TCTGCCGATG GACGAAGCCA AGGGCGAATG CCTGCTGGAT
ATCTCTGGCC GTCCTTTTAT CAAGTTCTCA GCCGATTTTG AACGTGAGCA CGTGGGTGAA
ATGGCCACAG AAATGGTGCC ACACTTCTTC CGCTCCTTTG CCGATGGCCT GCGCTGTACG
CTGCATGTTT CGACCGAAGG CGATAACGAT CATCACAAGG TTGAAGCCCT GTTTAAAGTG
CTCGGCCGCG CGCTACGCCA AGCGGTTAAA GTTGAAGGCG ATATTCTGCC ATCGAGCAAA
GGCGTACTTT AA
 
Protein sequence
MNPQFIPNVA QKILFIDRDG TLIEEPVTDK QVDSLAKLVF EPQVIPALLR LQKAGFRLVM 
VSNQDGLGTP SFPQEDFDAP HNMMMQILKS QGVNFEDVVI CPHFNDENCS CRKPKLGLVK
DYLTQGCIDF TQSAVIGDRE TDVELGNAMG IKSLKYQRDM LGWNAIADAL LNKGRTATVV
RTTKETDIRV TVDLDSPVKG KIDTGIGFFD HMLDQIATHG NFRMDVKVDG DLEIDDHHSI
EDTALAIGDA LRQALGDKRG IARFGFSLPM DEAKGECLLD ISGRPFIKFS ADFEREHVGE
MATEMVPHFF RSFADGLRCT LHVSTEGDND HHKVEALFKV LGRALRQAVK VEGDILPSSK
GVL