Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1673 |
Symbol | |
ID | 5713238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1742093 |
End bp | 1743274 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641267589 |
Product | phosphatase |
Protein accession | YP_001533016 |
Protein GI | 159044222 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.131499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.356135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGA GATCCCGCAG GCGGGCCACT CCCCACCTGC GCGCGGTACC CGACCTGATC GGCCCGGCAC CATTGTTGTC TCAGAGCCCG GCCCTCGCTG GTCATGGGCT GGCGCGGTCG CGTGTGGAGG GCACGCCGGA CCCGGCGGAT CTTTATGCCG CGCTCGATCT GGGCACGAAT TCCTGCCGGA TGCTGATCGC GCAGCCCAAG GGCAACCAGT TTCACGTGGT CGACAGCTTT TCCAAGTCGG TGCAACTGGG CCACGGGCTG GAGGCATCCG GGCGGCTGAG CCGGGCCTCC ATGGCGCGCA CGGTGGGGGC GTTGAAGATC TGCCAGCGCA AGCTCAAGAC CCACAACGTG CAGCGGATGC GCCTCGTGGC GACCGAGGCG TGTCGCCGGG CCAGCAATGC GCCGGAGTTC ATCTCGCTGG TGCAGCGGGA CACCGGGCTT GCGCTGGAAA TCATCGAACC AGAGCAGGAG GCGCGGCTGG CGGTGGTGTC CTGCGCGCCG CTGGTCTCCA CCCGGACCGA GCAGTTGTTG GTGGTCGATA TCGGTGGCGG CTCCACCGAG TTGGTCTGGA TCGATCTGAG CGCGGTGCCA AGGCTGGAGC GTCCCGGGGC GATCATGCGG CTCCATGCTG GGTTCGACGA GGTCACGCCG GGCATGGTGC CGGCGCGCGT GGTCGACTGG ATCTCGGTGC CGCTGGGGGT TGCGACCCTG AAAGACCAGT TCAGCGATGT GGATGATGAC GCGGCGCGGT TTGCTTTGAT GAGCTGGTTC TTCGAAGAGA ACCTGGCAGA ATTCTCGCCC TACAATGAAA CCGACCAGGT GCGCGAAGGC TTCCAGATCA TCGGCACCTC GGGCACGGTG ACGACAGTGG CGGCGAGCCA TCTGAACCTG CGCCGGTATG ACCGCAACAA GGTCGACGGG TTGCGGATGA CCTCGGACCA GATCGACACG GTGATACAGA CCTACCTGGC CATGGGGCCG GAGGGGCGGC GCAAGGATCC GCGGATCGGG CGCGATCGGC ATTCGCTGAT CATGTCGGGG GCCGCGATTC TGCAGGCGCT GCTGCGGGTC TGGCCCACGG ATCGGCTGTC GGTGGCGGAT CGAGGCTTGC GCGAGGGGCT GCTCTACGCG CAGATGAGCG CCGATGGCGT GCTCGAAGAC GGGCCCTACT GA
|
Protein sequence | MSKRSRRRAT PHLRAVPDLI GPAPLLSQSP ALAGHGLARS RVEGTPDPAD LYAALDLGTN SCRMLIAQPK GNQFHVVDSF SKSVQLGHGL EASGRLSRAS MARTVGALKI CQRKLKTHNV QRMRLVATEA CRRASNAPEF ISLVQRDTGL ALEIIEPEQE ARLAVVSCAP LVSTRTEQLL VVDIGGGSTE LVWIDLSAVP RLERPGAIMR LHAGFDEVTP GMVPARVVDW ISVPLGVATL KDQFSDVDDD AARFALMSWF FEENLAEFSP YNETDQVREG FQIIGTSGTV TTVAASHLNL RRYDRNKVDG LRMTSDQIDT VIQTYLAMGP EGRRKDPRIG RDRHSLIMSG AAILQALLRV WPTDRLSVAD RGLREGLLYA QMSADGVLED GPY
|
| |