Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2802 |
Symbol | |
ID | 5713702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2960490 |
End bp | 2961767 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641268728 |
Product | oxidoreductase |
Protein accession | YP_001534136 |
Protein GI | 159045342 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.207777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAT CAAACCCCTC TGATCCAGGA CGCCGCGGGT TCCTCAAAGG CGCCGCCGCC GTCACCGCCG GGGCCGCTAC CGCCGGGGCC GCCCGCGCCG CGGACGACCC GCTGATCACC GAGGTGCAGC CCTGGGCGCA GAGCTTCGGC GACGGCGTGG ACGCCACCCC CTACGGGATG CCCATCGAAT ACGAGAGCGA CGTGGTCCGC CGCAATGTCG AATGGCTGAC CGCCGACACG ATCAGCTCGA TCAACTTCAC CCCGATCCAT GCCCTCGACG GCACGATCAC CCCCCAGGGT TGCGCGTTCG AGCGGCACCA TTCCGGCGCC ATCGACCTGC CCAAGGAAGA CTACCGGCTG ATGATCAACG GGCTGGTGGA CACCCCCCTC GTGTTCACCT ACGCCGATCT CGAACGCTTC CCGCGCGAAA ACCACGTCTA TTTCTGCGAA TGCGCCGCGA ACACGGGCAT GGAATGGGCC GGCGCGCAGC TCAACGGCGC GCAGTTCACC CATGGCATGA TCCACAACAT GGAATATTCC GGCATCCCGC TCCGCACCCT GCTGAACGAG GCCGGACTCG ATGCGGCCGG GGATCTCGCC GACAAATGGG TCTTCGTCGA AGGCGCCGAT GCCTCGTCCA ACGGCCGCTC CATCCCCATG GTCAAGGCGC TCGACGACGT GCTCGTCGCG TTCAAGGCCA ATGGCGAGGC GCTGCGCAAG GAACACGGCT ACCCGGTGCG CCTGGTGGTG CCGGGCTGGG AGGGCAACAT GTGGGTCAAA TGGCTTCGCC GGGTCGAGGT GATGGACGGC CCCGTGGAAA GCCGCGAGGA AACCAGCAAA TACACCGACG TGCTCGAAGA CGGCACCGCC CGCAAATGGA CATGGGAGAT GGACGCGAAA TCCGTCGTCA CCTCCCCCAG CCCGCAAGCC CCGATCACCC ACGGCAAGGG GCCGCTGGTG ATCACCGGGC TGGCCTGGTC CGGCCGCGGG TCCATCACCC GCGTCGATGT CAGCCTCGAT GGCGGCAAGA ACTGGCAAGA GGCGCGGCTC GCCGCCCCGG GCACCGACAA GGCGCTGACC CGGTTCTATC TCGACCACGA CTGGCAGGGC GAAGAAATGC TGCTGCAATC GCGCGCCCAT GACAGCACCG GCTACGTCCA GCCCACCAAG AACCAGCTGC GCGAGATGCG CGGGCTGAAC TCGATCTACC ACAACAACGG CATCCAGACC TGGTGGGTGC GCGAAACCGG GGAGGCAGAG AATGTCGAGG TTTCCTAA
|
Protein sequence | MSTSNPSDPG RRGFLKGAAA VTAGAATAGA ARAADDPLIT EVQPWAQSFG DGVDATPYGM PIEYESDVVR RNVEWLTADT ISSINFTPIH ALDGTITPQG CAFERHHSGA IDLPKEDYRL MINGLVDTPL VFTYADLERF PRENHVYFCE CAANTGMEWA GAQLNGAQFT HGMIHNMEYS GIPLRTLLNE AGLDAAGDLA DKWVFVEGAD ASSNGRSIPM VKALDDVLVA FKANGEALRK EHGYPVRLVV PGWEGNMWVK WLRRVEVMDG PVESREETSK YTDVLEDGTA RKWTWEMDAK SVVTSPSPQA PITHGKGPLV ITGLAWSGRG SITRVDVSLD GGKNWQEARL AAPGTDKALT RFYLDHDWQG EEMLLQSRAH DSTGYVQPTK NQLREMRGLN SIYHNNGIQT WWVRETGEAE NVEVS
|
| |