Gene Sama_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1944 
Symbol 
ID4604194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2372250 
End bp2373317 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content55% 
IMG OID639781321 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_927819 
Protein GI119775079 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGA AAATGCTTTT TATCGACCGC GATGGCACCC TGATTGAGGA GCCGGTAACA 
GATAAGCAGG TCGACAGTCT CAGCAAGCTG GTGTTTGAAC CAACCGCCAT TCCCGCGCTG
CTGCGGCTGC AAAAAGCCGG TTACCGCCTG ATTATGGTCA GTAATCAGGA TGGCCTCGGC
ACCCCATCTT TCCCTCAGGA AGACTTTGAC GCCCCCCACA ACCTGATGAT GCAGGTTTTT
GAAAGTCAGG GCGTTAAGTT TGATGAGGTG CTGATTTGCC CGCACTTCAA CGATGAAAAC
TGCAGCTGCC GGAAGCCCAA ACTCGGTCTG GTAAAATCCT TCCTGACCCA AGGTCTGGTG
GATTTTACCG CCTCAGCAGT GATTGGCGAT CGCGACACCG ATGTCGAACT TGGCAACGCC
ATGGGCATCA AGAGCTTTAA GTATCAGCGT GAAACCCTCG GCTGGAACGC CATTGCCGAT
TCGCTGCTCG CCAAAGGCCG CTGCGCCACC GTTGTGCGCA CGACCCGCGA AACCGACATT
AGGGTCACGG TGGATTTAGA CACCCCGGGC AACAATCAAA TCGACACCGG CATCGGCTTT
TTTGACCATA TGCTTGATCA AATCGCCACC CACGGTAATT TCAGCCTCAA GCTTAACGTC
GATGGCGACC TTGAGATTGA CGATCACCAC AGTGTGGAAG ACACAGCATT GGCCTTGGGT
GATGCCCTGC GTCAGGCCCT TGGAGATAAG CGCGGCATCG GCCGTTTCGG CTTTGCCCTG
CCGATGGATG AAGCCTCGGG CCAGTGTTTG ATGGATATCT CAGGTCGGCC TTTTATCAAG
TTTGAGGCGA GCTTTAGCCG CGATAAAGTG GGCGAAATGG CCACCGAAAT GGTGCCGCAC
TTCTTTCGCT CCTTCGCCGA TGGTCTGCGC TGCACCTTAC ACATCGGCTG CGATGGCGAT
AACGATCACC ACAAGGTAGA AGCCCTGTTC AAGGTGCTTG GCCGCACCCT ACGCCAGGCC
ATTGCCATCG AAGGTGATGC CCTGCCATCG TCAAAAGGAG TGCTTTGA
 
Protein sequence
MPQKMLFIDR DGTLIEEPVT DKQVDSLSKL VFEPTAIPAL LRLQKAGYRL IMVSNQDGLG 
TPSFPQEDFD APHNLMMQVF ESQGVKFDEV LICPHFNDEN CSCRKPKLGL VKSFLTQGLV
DFTASAVIGD RDTDVELGNA MGIKSFKYQR ETLGWNAIAD SLLAKGRCAT VVRTTRETDI
RVTVDLDTPG NNQIDTGIGF FDHMLDQIAT HGNFSLKLNV DGDLEIDDHH SVEDTALALG
DALRQALGDK RGIGRFGFAL PMDEASGQCL MDISGRPFIK FEASFSRDKV GEMATEMVPH
FFRSFADGLR CTLHIGCDGD NDHHKVEALF KVLGRTLRQA IAIEGDALPS SKGVL