Gene Shewmr4_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1797 
Symbol 
ID4252371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2132545 
End bp2133636 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content51% 
IMG OID638118408 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_733928 
Protein GI113970135 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAG TATTTACCCC AAATGTCGCC CAGAAAATAC TTTTTATCGA CCGCGATGGC 
ACCTTGATTG AAGAGCCCAT CACCGATAAA CAGGTCGATA GCCTCGCCAA ACTGGTATTC
GAGCCCCAAG TGATCCCCGC CTTACTGCGC CTGCAAAAAG CTGGCTTTAG GTTGGTGATG
GTCAGCAATC AGGATGGACT CGGTACCCCT TCCTTTCCGC AGGAAGATTT CGATGCGCCG
CACAATATGA TGATGCAAAT TCTCTCCAGC CAAGGGGTTA AGTTTGAAGA TGTACTCATC
TGCCCGCACT TTAACGATGA AAATTGCAGC TGCCGCAAAC CAAAACTAGG GCTGGTGAAA
GACTTTTTGA CCCAAGGCAC CATCGATTTC ACTCAGTCCG CCGTGATTGG TGACAGACAC
ACAGATGTGG AACTGGGCAA TGCCATGGGG ATTAAAAGCT TTCAATATCA GCGTGGCAGC
CTAGGCTGGG ACGCGATTGC CGATGCCTTA CTCAACAAGG GCCGCACCGC GACTGTGGTA
CGCACCACTA AGGAAACCGA TATTCGCGTG ACTGTCGATC TCGACAATGC CAGCAAAGGC
ACCATTAACA CTGGCATTGG TTTCTTCGAC CATATGCTGG ATCAAATCGC CACCCACGGA
AACTTCAAAA TGGAGGTGAA TGTCGATGGC GATCTCGAGA TAGACGATCA CCACAGCGTT
GAAGACACGG CATTGGCGAT TGGGGATGCA CTACGCCAAG CCCTTGGCGA TAAACGCGGT
ATTGCTCGTT TTGGTTTTAG TTTGCCGATG GATGAGGCCA AGGGCGAATG CTTACTGGAT
CTCTCTGGCA GACCTTTTAT TAAATTCGCC GCCCAATTTG AACGGGAAAA AGTCGGTGAA
ATGGCCACTG AAATGGTGCC GCACTTTTTC CGATCTTTTG CCGATGGCCT ACGCTGCACC
CTGCACGTCG CAGCCGAAGG AGATAACGAT CACCACAAGG TCGAAGCGCT ATTTAAAGTA
CTGGGCCGCG CGCTGCGTCA TGCAATCAAA GTTGAAGGTG ATGTCTTACC ATCGAGTAAA
GGCGTGCTCT AA
 
Protein sequence
MNPVFTPNVA QKILFIDRDG TLIEEPITDK QVDSLAKLVF EPQVIPALLR LQKAGFRLVM 
VSNQDGLGTP SFPQEDFDAP HNMMMQILSS QGVKFEDVLI CPHFNDENCS CRKPKLGLVK
DFLTQGTIDF TQSAVIGDRH TDVELGNAMG IKSFQYQRGS LGWDAIADAL LNKGRTATVV
RTTKETDIRV TVDLDNASKG TINTGIGFFD HMLDQIATHG NFKMEVNVDG DLEIDDHHSV
EDTALAIGDA LRQALGDKRG IARFGFSLPM DEAKGECLLD LSGRPFIKFA AQFEREKVGE
MATEMVPHFF RSFADGLRCT LHVAAEGDND HHKVEALFKV LGRALRHAIK VEGDVLPSSK
GVL