Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1898 |
Symbol | |
ID | 5705943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2192271 |
End bp | 2193293 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271402 |
Product | HAD family hydrolase |
Protein accession | YP_001536774 |
Protein GI | 159037521 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000214064 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGACAA CCGGCGGCGC ACGGCTGGTT GACGGGTATG CCCTGGTCGT CTTCGACCTG GACGGGGTGA TCTACCTGGT TGACCGGCCG ATTCCTGGTG CGGTCGAAGC GGTGAGCCAG CTGCACGCCG ACGGGCAGGC GGTCGCATAC GCCACGAACA ACGCATCTCG CCGGTCGAGC GAGGTGGCCG ATCTGCTTAC CGGGATGGGC ATTGCCGCGC GGCCGGAGGA GGTGCTGACC TCTGCGGCGG CCGCCGCGCA GTTGCTTCGT GAGCGGTATC CGGAGGGGTC GCAGATCCTG GTCGTGGGGG CAGAGGCACT GCGCGCCGAG ATCCGCGCCG CCGGGCTCAC CCCGGTCACG CGGGCTGATG ACGGACCGGT TGCGGTCGTG CAGGGGTACG GTCCGCAGGT CGGCTGGACC GATCTGGCCG AGGCGGCGGT GGCTGTCCGG GGCGGGGCGA CCTGGGTTGC CACCAACACG GACCGTACGT TGCCAAGCGG GCGTGGTCCA CTACCCGGCA ACGGTGCCTT GGTTGCCGCG GTGCGGACCT CGCTCGGTCG GGGGCCGGAT GTGATTGTCG GCAAGCCGGC ACCGGAACTC TTCGCCGCCG CCGCCCGCCG GGTTCCCGCG GGCCGTGCGT TGGTCGTCGG CGACCGCTTG GACACCGATA TTGAGGGCGC GGTCCGAGCC GGGCTGGACA GTCTGCTCGT GCTGACCGGT GTCAGCGACG TGGCCGAGTT GTTGGCCGCC CCGCCGCAGC GCCGGCCAAC GTACGTTTCG GTGGATCTGG CGGGGCTGTT CGAGCCGGAG GCTGTGGTGC GGGTGCCAGG CCCGATGGAG GCCGGTGGAT GGTCTGCGGC GGTCCGCGAT GGTCGGCTGG AGCTGTCCGG AGCGGGACGC ACGCTGAGCG CACTGCCTGT CCTCTGTACG GCGGCGTGGT CGACGGCGCA GCCGTCACCA GTGCGGGCCG CCTCGTCGGC GGCCGAGCGT GCGCTCGCAA CGTTCGGCTT GCTGTCCGAC TGA
|
Protein sequence | MTTTGGARLV DGYALVVFDL DGVIYLVDRP IPGAVEAVSQ LHADGQAVAY ATNNASRRSS EVADLLTGMG IAARPEEVLT SAAAAAQLLR ERYPEGSQIL VVGAEALRAE IRAAGLTPVT RADDGPVAVV QGYGPQVGWT DLAEAAVAVR GGATWVATNT DRTLPSGRGP LPGNGALVAA VRTSLGRGPD VIVGKPAPEL FAAAARRVPA GRALVVGDRL DTDIEGAVRA GLDSLLVLTG VSDVAELLAA PPQRRPTYVS VDLAGLFEPE AVVRVPGPME AGGWSAAVRD GRLELSGAGR TLSALPVLCT AAWSTAQPSP VRAASSAAER ALATFGLLSD
|
| |