Gene Sare_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4101 
Symbol 
ID5706528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4660480 
End bp4661298 
Gene Length819 bp 
Protein Length272 aa 
Translation table11 
GC content72% 
IMG OID641273527 
Producthistidinol-phosphate phosphatase, putative 
Protein accessionYP_001538882 
Protein GI159039629 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family 
TIGRFAM ID[TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.332683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGTT ACGCCGCCGA CCTCGCCCTC GCCCACCACC TCGCCGACGC CGCCGACGCG 
GTCTCCGTCG CGCGATTCCG CGCCCTGGAC CTCCGCGTGG ACACGAAGCC CGACCTCAGT
CCGGTCTCCG ACGCGGACAC CGCGGTCGAG CAGGAGATCC GGGCCCTGCT CGCCACCCAC
CGCCCGGACG ACGGCCTGCT CGGCGAGGAG TACGGCGAGC AGCCCGCGAC CGGTTCCGGC
CGGCGGCGCT GGGTGGTCGA CCCGATCGAC GGGACGAAGA ACTTCATTCG CGGCGTGCCG
GTGTGGGCCA CACTCATCGC CCTGCTTGAG GGGGACCGGC CGGTCGCCGG TCTGGTCTCC
GCACCGGCCC TGGGCCGTCG CTGGTGGGCG GCGGTCGGCG AGGGGGCGTA CGCGGGGCCG
GACCTACCGT CCGGTACGCC GATCCGGGTG TCGGCGGTAA CCGATCTGAG CGACGCCAGT
TTCTGCTACT CCTCGCTCGG CGGCTGGGAG GACAACGGTC GGTTGGGAGC TGTCCTGCAG
ATCATGCGGG ACGCCTGGCG CAGCCGGGCG TACGGCGACT TCTACGGTTA CATGCTGCTG
GCCGAGGGCG CGCTGGACAT CATGGTGGAG CCGGAACTGT CATTGTGGGA CATCGCGGCG
CTGGTGCCGA TCGTCACCGA GGCGGGGGGA ATGCTCACCG ACCTGGCTGG CCGGCCCGCC
CCTGGAGACA CCAGCTCCGG CGGTACCAGC GCGGTTGCCA CGAACGGTCC GCTGCACGCC
GGTATCCTCA CCCGCCTGAG CGGAGCCCCT GCACGCTGA
 
Protein sequence
MKGYAADLAL AHHLADAADA VSVARFRALD LRVDTKPDLS PVSDADTAVE QEIRALLATH 
RPDDGLLGEE YGEQPATGSG RRRWVVDPID GTKNFIRGVP VWATLIALLE GDRPVAGLVS
APALGRRWWA AVGEGAYAGP DLPSGTPIRV SAVTDLSDAS FCYSSLGGWE DNGRLGAVLQ
IMRDAWRSRA YGDFYGYMLL AEGALDIMVE PELSLWDIAA LVPIVTEAGG MLTDLAGRPA
PGDTSSGGTS AVATNGPLHA GILTRLSGAP AR