Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3212 |
Symbol | |
ID | 5712268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3377980 |
End bp | 3379251 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641269139 |
Product | oxidoreductase molybdopterin binding |
Protein accession | YP_001534546 |
Protein GI | 159045752 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAAC CTGAGAAGTC TTCATCGCGC CTGCTGGCAT CTTCTGGTCG CCGCGCTTTT CTGGGGGCCA GCGCCGGGCT TGGGGCTGCT GCCGTCGCCG GGCTTGCGGC CAGGGCCGCG CCGCTTGCCG TACCCGAAAG CAACCGGACC ATGGGCGATC CGATCCCGGA GACGGATTAC GGTATGCCGA TCGAGTACGA GGACCACGTC CGGCGCAGGC GCACGGATGT TTTCGTCAAT CGGCAGAACT ATTCTGACTG GAGCATGACC CCCCTGCATC AACAGTTGGG CATCGCGACG CCCAACGGGC TGTTCTTCGA GCGCCACCAT AACGGCGTTG CCCGGATTGA CCCGGATGTG CACCGGGTCG CGATCCACGG AATGGTGCGG CAGCCGCTCC TGTTCTCGAT GGACGACCTG ATGCGGTATC CTTCGGTCTC GCGGTTTCAT TTTCTGGAGT GCTCCGGCAA CGGGCTGACG GATTGGCGCG AGGCACGTTC GACCACCGTG CAACAAAGCC ATGGCCTTCT GTCCTGCGCG CAATGGACCG GCATCCCCCT GTCATGGCTT CTGGATGAAG CGGGCCTGCA GGACGGCGCG AGCTGGGTCG TTCTGGAGGG CGCCGATGGG TCCGGGCACC TGCGATCGAT CCCGATCGAC AAGATCATGG ATGATGCGCT GCTGGCCTAT GGTCAGAATG GTGAGATGCT GCGGGCGGAG CAGGGCTATC CGGTGCGTGC GATCCTTCCC GGCTGGGAAG GAAACACCAA CGTCAAATGG CTGCGCCGGA TCTATGTGAC AAACGAACCG TTGCACGTTC GGGGCGAGAC CGCGCGCTAT ACCGACCCCA TGCCGGATGG CAAATGGCGG CAGTTCTCCA TGGAGATGGA GGCGAAATCG GTCATCACCA ACCCGTCCGG GGGCATGCGC CTGCCCGGAC CGGGCCCGGT GGAGCTGTCC GGCTTTGCGT GGTCCGGAAA CGGCACGATC AGCCATGTGG ACGTGACCGT CGATGGTGGG CGCAGCTGGG TCGAAGCGCA ACTCGAAGGC CCGGTGATGG AGAAGTGCCT GACGCGCTTC CGGTTCCGGT GGAACTGGGA CGGAACGCCG GCGAAGATTG CCAGTCGCGC CGTCGACAGC ACGGGATACG TGCAGCCCAC CGCCGAGCAA CTGGCGCGAG TCCGGGAGAT CTCGGGCTTC GTGCAGCACA ACAATGCGAT CTTCCCATGG ACGATCGCTT CCAACGGGGA GGTCGGCAAT GCAATCGCGT AA
|
Protein sequence | MDQPEKSSSR LLASSGRRAF LGASAGLGAA AVAGLAARAA PLAVPESNRT MGDPIPETDY GMPIEYEDHV RRRRTDVFVN RQNYSDWSMT PLHQQLGIAT PNGLFFERHH NGVARIDPDV HRVAIHGMVR QPLLFSMDDL MRYPSVSRFH FLECSGNGLT DWREARSTTV QQSHGLLSCA QWTGIPLSWL LDEAGLQDGA SWVVLEGADG SGHLRSIPID KIMDDALLAY GQNGEMLRAE QGYPVRAILP GWEGNTNVKW LRRIYVTNEP LHVRGETARY TDPMPDGKWR QFSMEMEAKS VITNPSGGMR LPGPGPVELS GFAWSGNGTI SHVDVTVDGG RSWVEAQLEG PVMEKCLTRF RFRWNWDGTP AKIASRAVDS TGYVQPTAEQ LARVREISGF VQHNNAIFPW TIASNGEVGN AIA
|
| |