Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0629 |
Symbol | |
ID | 5712088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 626365 |
End bp | 627237 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641266537 |
Product | periplasmic substrate-binding protein |
Protein accession | YP_001531976 |
Protein GI | 159043182 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1910] Periplasmic molybdate-binding protein/domain |
TIGRFAM ID | [TIGR01764] DNA binding domain, excisionase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCA ACGCCCCTGA CGCCCCGAGA TTTCTGACCA CCAAGGAGGT GGCGGATTTG CTGCGGGTGC GGGAGCGGAA GGTCTATGAC CTGGCCGGCG CGGACGAGAT CCCCCATCGC CGGATCACAG GGAAACTGCT GTTCCCGCGC GAGGAACTTC TCGACTGGAT CGAAGGAGAC CGGCAGGCGA CCGCCCTGCC TCCGGTGCTG ACCGGGTCGC ATGATCCGTG GCTCGCCTGG GCGGTCGGGG CCTCCGACTG CGGGCTGGCG GTGTTGCAGA ATGGCAGTGC CGAAGGGCTC GCGCGCTTCG CGGCGCGAGA GGCCGCGTTC TGCGGGCTGC ACATCCCCGA AGACGACGGC TGGAACGTGG GCAGCGTGCG GGCGCAGGGG GTGCGCGACT GCGTGCTGAT CGGATGGGCG ATACGCCAAC GGGGGCTGCT GCTGTCCCCT GCCCTTGCCG CCGATGTCAA AGATGTGGCC GCCCTGCGAG GCCAGCGTGT CGTGCTGCGC CAGCTGGGCG CCGGGGCGGC GGTATTGCTG GAGCGGTTGC TGGACGATGC GGGGATGGCA CTGTCCGACC TGGTCACGGG GACGGATGTG GCCCGAACCG AGCACGAGGC GGCGGAGGCG GTCGCCTCCG GCGAGGCGGA CGCGGCGCTG GGACTTGCGG CCATGGCACG GCAGTTCAAG CTGGGCTTCG TGCCGATCGT GGAGGAACGC TTCGACTTGC TGGTCGACCG GGCGTTCTAT TTCTCGGATC CGTTCCAGCG CCTGCTGCAA TTCGCGTCAA CGCCCGCGGC CTTGGAAAAG GCGCGGGCGC TGGGGGGCTA CGACGTGTCC GAGCTTGGCA CGGTCCGCTG GAACGGACCG TGA
|
Protein sequence | MDLNAPDAPR FLTTKEVADL LRVRERKVYD LAGADEIPHR RITGKLLFPR EELLDWIEGD RQATALPPVL TGSHDPWLAW AVGASDCGLA VLQNGSAEGL ARFAAREAAF CGLHIPEDDG WNVGSVRAQG VRDCVLIGWA IRQRGLLLSP ALAADVKDVA ALRGQRVVLR QLGAGAAVLL ERLLDDAGMA LSDLVTGTDV ARTEHEAAEA VASGEADAAL GLAAMARQFK LGFVPIVEER FDLLVDRAFY FSDPFQRLLQ FASTPAALEK ARALGGYDVS ELGTVRWNGP
|
| |