Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1238 |
Symbol | dmsA2 |
ID | 5711796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1282492 |
End bp | 1285278 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641267150 |
Product | molybdopterin oxidoreductase Fe4S4 region |
Protein accession | YP_001532581 |
Protein GI | 159043787 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.389489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0809066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAATTG ATCCCATGAA AGACAGTGTC GCACCAGACA TCCCGACCCC GCGCTTCGAC GAGATGCGAC AAACGACCTG CTACATGTGC GCCTGCCGCT GCGGGATCAA CGTTCACCTC AAGGACGGCA AGGTCGCGTA TATCGAGGGC AATCGCGCCC ACCCTGTGAA CAAGGGCGTC CTTTGCGCCA AGGGCTCCGC AGGCATCAAG CAGCACACCG CGCCCTCGCG CCTGCGCGCG CCGATGAAGC GCGTCGGCCC GCGCGGCTCG GGCCAGTTCG AAGAGATCAC CTGGGACGAG GCCCTGCAAT TGGCGGTCGA CTGGCTCGCG CCCTTGCGCG AGACGGCACC CGAGAAGCTC GCCTTCTTCA CCGGGCGCGA CCAGTCCCAA AGCTTCACGA GCTTCTTCGC GCAGAATTTC GGGACCCCGA ACTACGCCGC CCATGGCGGG TTCTGCTCCG TCAACATGGC AGCGGCCGGC ATCTACACCA TGGGCGGGGC CTTCTGGGAA TTCGGCGCGC CGGATTGGGA GCATACCAGG ATGTTCCTGC TCTTCGGCGT GGCCGAGGAC CATGACAGCA ACCCGATCAA GATCGGCCTC GGCAAGCTGA AGGCCCGCGG TGCCAAGGTC GTCGGCATCA ATCCGATCCG CACCGGCTAC AACGCGATTG CCGATGACTG GTACGGGATC ACACCGGGCA CGGACGGCCT CTTGATCCTG TCGCTGATCC ACTGCCTGCT GCGCGCGGGC CGGATCGATC TCGACTACCT CGCGGCCTTC ACCAACGCGC CTGTCCTGGT GAACGAGGAT CCCGCCAGCC CCGAAAAGGG TCTCCTGCTG CGCGACGAGG CGGACAAGCT GCTGATCCTC GATCGCGCCA CGGGCAAGCC GGTGCCTTTC GACACCCAGG GTGTGCAGCC CGATCTCGCG GGCTCCTATC GCCGCGCGGG CGTATCGCAC CGCCCCGTGA TCCAGCACAT CGCCGAGAAG TACCTGAGCG ATGATTACGC GCCCGAGGCC GTGGCCGACC GCTGCGGCAT CCCGGCCGCG CGCATCCGCA CGCTTGCCGC CGAGCTTGCC CGGACCGCCT TCGACGAGGC CTTCGAACTC GATATCCCCT GGACCGATTT CCGCGGCAAC AAGCACCAGA CCATGCAAGG CCGCCCGGTC TCGATGCACG CCATGCGGGG CATCTCCGCC CATGCCAACG GGTTTCAGAC CGCCCGCGCG CTGCACCTGT TGCAGATCCT GCTGGGCACG GTGGAGGTCC CCGGCGGCTT CCGCTTCAAG CCCCCCTACC CCAAACCGCC CGAGGCGCAT CCCAAGCCCC ATTGCAAGGT CACCCCCGGC GCACCGCTCG ATGGCCCGCA TCTGGGCTTC GTGCACGGCC CGGACGATCT GTGCCTGACC CCCGAGGGCG CGCCTGCCCG CATCGACAAG GCGTTCTCCT GGGACAACCC CATGTCGGCC CACGGGCTGA TGCACATGGT GATCTCCAAC GCCCATGCAG GCGATCCCTA CAAGATCGAC ACCCTGTTCA TGTACATGGC CAACATGGCC TGGAACTCCT CCATGAACAC CGCCGAGACC ATGGCGATGC TCACCGACAC CGACGAGACC GGCGCCTACA AGATCCCGCG CATCATCTAT GCCGATGCCT ACAAATCCGA GATGGTCGCC TATGCCGACC TGATCCTGCC GGACACCACC TATCTCGAAC GCTACGACTG CATCTCGCTG CTCGACCGTC CGATCTGCGA GCCGGACGCC GTGGCCGACG CGATCCGCTG GCCGGTGGTC GAGCCTGATC GCGACGTGCG CGGGTTCCAG TCGGTGCTGG TGGATCTCGG CGCACGGCTG GGTCTGCCCG GCTTCGTCGA TGCCGAGGGC GCGCCGCTTT ACGCCGATTA CGCCGATTAC ATGGTCAAGC ACGAACGCCG CCCCGGCGTC GGCCCGCTCG CGGGCTTTCG CGACACCGGC GAGGCGGCGG GCCGCGGCGC GCCGAACCCT GACCAGATCC AGCGCTACAT CGACAATGGC GGCTTCTGGG AGGCCGAGGT GCCCAAAGAG GCCCGCTACT ACAAACCGTG GAACACCGCC TACCAGGACT GGGCGGTCGA AATGGCCTTC TACGACGCGC CGCAACCCTA CATCTTCAAC CTCTGGTCCG AACCCCTGCG CCGGTTCCAG CTCGCCGCCG AGGGCCATGG CGAACGCCAA CCGCCCGACC ACTTGCGCGC GCAAATCCGC GAGAAGCTCT CGCCGCTCCC GATCTGGTAC GAGAGCACCG ATGCCCGCGC GAACGAATTC CCCCTTCACG CCCTGACCCA GCGGCCCATG GCGATGTACC ACTCCTGGGG CTCGCAAAAC GCCTGGCTGC GCCAGATCCA CGGGGTCAAC CCGCTCTACG TGCCCTCCGC GGTCTGGGAG GAACACGGGT TTTCCGACGG CGACTGGGCC ACGCTCATCT CACGCCACGG GGAGATCACC ATCCCCGTGG CCCTGATGGC CGCACTCAAC CCGAAAACCG TCTGGACCTG GAATGCCATC GGCAAGCGCA AGGGCGCCTG GGGCCTCGAC GCGGACGCGC CGGAGGCGAC CAGGGGCTTC CTGCTCAACC ACCTGATCTC CGAACTTCTG CCCGCCCGCG CCGACGGGAT GCGCTGGTCG AACTCCGACC CCGTGACAGG ACAGGCCGCC TGGTTCGACC TGCGCGTCAA CATCGCCAAG GCTCCGGCGC GCGCCGAGGC CGCACCCAGC TTCCCCCCGG TCCACGGCCC CGCATGA
|
Protein sequence | MEIDPMKDSV APDIPTPRFD EMRQTTCYMC ACRCGINVHL KDGKVAYIEG NRAHPVNKGV LCAKGSAGIK QHTAPSRLRA PMKRVGPRGS GQFEEITWDE ALQLAVDWLA PLRETAPEKL AFFTGRDQSQ SFTSFFAQNF GTPNYAAHGG FCSVNMAAAG IYTMGGAFWE FGAPDWEHTR MFLLFGVAED HDSNPIKIGL GKLKARGAKV VGINPIRTGY NAIADDWYGI TPGTDGLLIL SLIHCLLRAG RIDLDYLAAF TNAPVLVNED PASPEKGLLL RDEADKLLIL DRATGKPVPF DTQGVQPDLA GSYRRAGVSH RPVIQHIAEK YLSDDYAPEA VADRCGIPAA RIRTLAAELA RTAFDEAFEL DIPWTDFRGN KHQTMQGRPV SMHAMRGISA HANGFQTARA LHLLQILLGT VEVPGGFRFK PPYPKPPEAH PKPHCKVTPG APLDGPHLGF VHGPDDLCLT PEGAPARIDK AFSWDNPMSA HGLMHMVISN AHAGDPYKID TLFMYMANMA WNSSMNTAET MAMLTDTDET GAYKIPRIIY ADAYKSEMVA YADLILPDTT YLERYDCISL LDRPICEPDA VADAIRWPVV EPDRDVRGFQ SVLVDLGARL GLPGFVDAEG APLYADYADY MVKHERRPGV GPLAGFRDTG EAAGRGAPNP DQIQRYIDNG GFWEAEVPKE ARYYKPWNTA YQDWAVEMAF YDAPQPYIFN LWSEPLRRFQ LAAEGHGERQ PPDHLRAQIR EKLSPLPIWY ESTDARANEF PLHALTQRPM AMYHSWGSQN AWLRQIHGVN PLYVPSAVWE EHGFSDGDWA TLISRHGEIT IPVALMAALN PKTVWTWNAI GKRKGAWGLD ADAPEATRGF LLNHLISELL PARADGMRWS NSDPVTGQAA WFDLRVNIAK APARAEAAPS FPPVHGPA
|
| |