Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5035 |
Symbol | |
ID | 5707306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5702899 |
End bp | 5703720 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641274428 |
Product | HAD family hydrolase |
Protein accession | YP_001539769 |
Protein GI | 159040516 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0548232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCGCC CGGGTCTGCC CAAGCTGATC GCCACCGATC TGGACGGGAC GCTCGTCCGC AGCGACGACA CCGTCTCCGC GTACACGCAT GAGGTACTCG ACCGGGTACG CGCCGCCGGC ATTCCGGTGG TCGGCGCGAC CGGTCGCGGA CCGCGGCTGA CCCAACTGAC CCGCAACGAC ATTCGTGCCG CCGACTTTCT GGTGATGGCC GGCGGCGGCC GAGTGGTCGA CCAGAGCGAC CCGGACGGGC CGCTGGTGCT ACGCGACGAG TGGCTCTCCG GCGCCGTACT GGCGGAACTG CTGACCGAGC TGGAGGCAGC AGTCGGCCCG CTGACCGTGA TGGTGGAATC GTCGGACGAA CACGACGCAC CGCTGTGGGG GGATTACCAC GCGAGCTGGC CCTACCAGGA CCGGTTCGAG GCGCGCAGCC GAGCCGAGTG CCTCTCCGGC AACGTGATCA AGGCATTCGC CCGTACCGCC GACCACCACG TCGACGAACT GTTGGACGTG GCCCAGCGCA TCGTCCCACC GCACACCGCC ACACTCACCC AGGCCGGCCT GGGCTTTGTC GAGATCTGCC CGCCTGGCGT GGACAAGGCC ACCGGGCTCG GTGTGGTCGC CGAGCGGCTC GGCGTGGATC CGGCGGAGGT GTTGGTTTTC GGCGACCAGC CGAACGACCT GCCGATGTTC TCTTGGGCGG GCTGGGCGCG GGTGGCCGTG GCGAACGCGC ATCCGGCGGT CCACGCGGCT GCCGACGAAG CCACCCTACG CAACGACGAC GACGGGGTCG CCGTCTACCT TGACCGGCTG CTGACCCGTT GA
|
Protein sequence | MVRPGLPKLI ATDLDGTLVR SDDTVSAYTH EVLDRVRAAG IPVVGATGRG PRLTQLTRND IRAADFLVMA GGGRVVDQSD PDGPLVLRDE WLSGAVLAEL LTELEAAVGP LTVMVESSDE HDAPLWGDYH ASWPYQDRFE ARSRAECLSG NVIKAFARTA DHHVDELLDV AQRIVPPHTA TLTQAGLGFV EICPPGVDKA TGLGVVAERL GVDPAEVLVF GDQPNDLPMF SWAGWARVAV ANAHPAVHAA ADEATLRNDD DGVAVYLDRL LTR
|
| |