Gene Sare_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1898 
Symbol 
ID5705943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2192271 
End bp2193293 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content71% 
IMG OID641271402 
ProductHAD family hydrolase 
Protein accessionYP_001536774 
Protein GI159037521 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000214064 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACAA CCGGCGGCGC ACGGCTGGTT GACGGGTATG CCCTGGTCGT CTTCGACCTG 
GACGGGGTGA TCTACCTGGT TGACCGGCCG ATTCCTGGTG CGGTCGAAGC GGTGAGCCAG
CTGCACGCCG ACGGGCAGGC GGTCGCATAC GCCACGAACA ACGCATCTCG CCGGTCGAGC
GAGGTGGCCG ATCTGCTTAC CGGGATGGGC ATTGCCGCGC GGCCGGAGGA GGTGCTGACC
TCTGCGGCGG CCGCCGCGCA GTTGCTTCGT GAGCGGTATC CGGAGGGGTC GCAGATCCTG
GTCGTGGGGG CAGAGGCACT GCGCGCCGAG ATCCGCGCCG CCGGGCTCAC CCCGGTCACG
CGGGCTGATG ACGGACCGGT TGCGGTCGTG CAGGGGTACG GTCCGCAGGT CGGCTGGACC
GATCTGGCCG AGGCGGCGGT GGCTGTCCGG GGCGGGGCGA CCTGGGTTGC CACCAACACG
GACCGTACGT TGCCAAGCGG GCGTGGTCCA CTACCCGGCA ACGGTGCCTT GGTTGCCGCG
GTGCGGACCT CGCTCGGTCG GGGGCCGGAT GTGATTGTCG GCAAGCCGGC ACCGGAACTC
TTCGCCGCCG CCGCCCGCCG GGTTCCCGCG GGCCGTGCGT TGGTCGTCGG CGACCGCTTG
GACACCGATA TTGAGGGCGC GGTCCGAGCC GGGCTGGACA GTCTGCTCGT GCTGACCGGT
GTCAGCGACG TGGCCGAGTT GTTGGCCGCC CCGCCGCAGC GCCGGCCAAC GTACGTTTCG
GTGGATCTGG CGGGGCTGTT CGAGCCGGAG GCTGTGGTGC GGGTGCCAGG CCCGATGGAG
GCCGGTGGAT GGTCTGCGGC GGTCCGCGAT GGTCGGCTGG AGCTGTCCGG AGCGGGACGC
ACGCTGAGCG CACTGCCTGT CCTCTGTACG GCGGCGTGGT CGACGGCGCA GCCGTCACCA
GTGCGGGCCG CCTCGTCGGC GGCCGAGCGT GCGCTCGCAA CGTTCGGCTT GCTGTCCGAC
TGA
 
Protein sequence
MTTTGGARLV DGYALVVFDL DGVIYLVDRP IPGAVEAVSQ LHADGQAVAY ATNNASRRSS 
EVADLLTGMG IAARPEEVLT SAAAAAQLLR ERYPEGSQIL VVGAEALRAE IRAAGLTPVT
RADDGPVAVV QGYGPQVGWT DLAEAAVAVR GGATWVATNT DRTLPSGRGP LPGNGALVAA
VRTSLGRGPD VIVGKPAPEL FAAAARRVPA GRALVVGDRL DTDIEGAVRA GLDSLLVLTG
VSDVAELLAA PPQRRPTYVS VDLAGLFEPE AVVRVPGPME AGGWSAAVRD GRLELSGAGR
TLSALPVLCT AAWSTAQPSP VRAASSAAER ALATFGLLSD