Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SYO3AOP1_1175 |
Symbol | |
ID | 6331983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfurihydrogenibium sp. YO3AOP1 |
Kingdom | Bacteria |
Replicon accession | NC_010730 |
Strand | + |
Start bp | 1221901 |
End bp | 1224789 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642657457 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_001931341 |
Protein GI | 188997090 |
COG category | [C] Energy production and conversion |
COG ID | [COG5013] Nitrate reductase alpha subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR03479] DMSO reductase family type II enzyme, molybdopterin subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000509072 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATCT CAAGAAGGGA CTTTTTAAAA GTAATGTCAG CAACAGGTGG AGCTTTAGGT CTTGGGTCAA GCGAGGCTTT TGCTAAAGCT AAAGCTGTTG TAGTAGATGA CCCAAGAGCA TCTTATCCAA ACGGCTCTTT TGTTGAAAAT ATGTACAGAA GAGAGTTTGC ATATACTTAT GGAAAGAAAG AAGAGCATGG TACTGCTTAT CACTGCGTAA ACTGTCAAGG AAACTGTGCT TGGGATGTAT GGGTTCAAAA CGGTATAGTT ACAAGAGAAA ACCAAGCTGC TAACTATCCA CAAATCAACC CAAAAATTCC TGATGCTAAC CCAAGGGGAT GTAACAAAGG CGTTCAACAC TCTCAAGTAA TGTATGAAAA AGACAGAATT CTCTATCCAA TGAAAAGAGT CGGAGCGAGA GGAGAAGGTA AGTGGAAAAG AATATCTTGG GATGAAGCAA TAACAGAAGT TGCAACAAGA ATTTATGAAA CTATGCTTAC AAAAGGACCA GCTGGAAACT ACATCCACGT TGGTGCTGGT ATGCTTACAG AGGCAAGAGC AGCTTCTGGT AAAAGACTTG GAACACTTCT TGGTGCTGTT AGACCTTACA TAGCTTCTTA CGTTGGTGAT ATGTTCCCTG GCGTTTCCTT AGTATACGGC GAAGGAAACA TTGGATTTTC TTACGATTTT GTATACACTG CAAACGTTCA AATCTGGTGG GGTATGGACC CTAACAAAAC AAGAATTCCG GATGCTCACT GGGTATGGGA AGGAAAATAC AACGGTGGAA AAGTAATCGT TATCACTCCA GACTTTAACG CTACTGCAAA AGGAGCTGAT TTATGGGTGC CGGTTAGAGC CGGATACGAT GGTTTCTTAG CAATGTCTAT CATCAACGAA ATTATTCAAC AAAAGCTTTA CAAACCAAAC TTTATAAAAG TGTTCACAGA TCTACCATTC TTAGTAAGAT TAGATAACAA AAAACTATTA AGACTTTCTG ATATTGACAC AAACGACCCA ATGTTTGATA AAGAAGTCTT CCATGTTATG GAAGAAAAAG CACATGGAAA AGAATTTGAA GCAGATGCTG TTTTCTTAGC TTATAACCTT AAAAACGGTA AATTTACAAT CATGCCGGGT TCTGAAGGAA ACCCAGTTAA AACATTGAGA CTAAAAGATT TAGGTTGGGA TATTGACCCA GCGTTAGAAG GTGTATATGA AGTTAAGTTA AAAGACGGTT CAAAGGTAAA AGTTACACCT GTATTTGAAT TGGTTAAAGC AGAAGCAGCT AAATTCCCGG CTGAGAAAAC ATTTAAATTA ACAAACGTTC ATCCAAAAAT CGTTCAACAG CTTGCAAGAG ATATAGCTCT TCCAAAAGTA GCGTTTATAT CTATGGGATT TACAATCGGT AAATACTTTA ATGGTATGTT AACCCAAAGG GCTATTGCTT CTATTACTCC TTTATGTGGA AGGCTTGGAC CTTATGGTGG ATTTAACTCG GAAAATGAGT GGTCTATATC TGGTTTAGCT AAGATCTCTG GTTTTGCAGG AAAGTATAAA GAAAGATTCG CATCCGGTTT TGTAAGCGAA TTTGTTCTTG GAAATATGAT GCAGGACTTT GATAAGCTCT ATGAAGAGGA TACATTCAAA GAATCTATGG GAATGTCTAA GGAAGAGTAT AAAAAGCAAG TGAATGAAAT GCTTTCTAAG TCTGAAGGCG ACAAAGGAAT CGGACATGGT AAATCTTACT GGAATGATGT AGAAACGTTC TTACTTTTTG CAGATGCCAG ATTTAGAAGA AATAAAGGTT CTTCTTATAA AAAGGCATTC TTTGAAAAGG CTAAATTTAT TGCTTATGTA GATTTTAGAA TGTCTGATTT TGCAAACTAT GCAGACATTT TACTTCCTGC AAAGTCTCAC TATGAAGTTT GGGACCTAAG AACAAACCCA GGTTATCATA GATTTGCAAA CCTTGCCCAT CCTCCAGCAA ACTTAAAACC AGTTGGCGAA GCTAAGTCTG AATGGGAAAT TTGTACTATG ATTGTTGAGA AAATTCAAGA AATAGCGACT AAAAAATACA AAGAAACAGG CGACCAAAAA TATATTAAAA TTCCAGACCC ACAACTTTCT AAAACAGGAT ACAGAGACCT TGACACATTA GTTGAAGAAT ACACAATCGG TGGTCAGTTA AGAAACGACA GAGATGCTGT AGAGTTAGCA TTAGAAAATA CTGACCAGTT TAAGCCTAAT ACTATTGAAT CTATGTTCAA AAGAGGCGGT TATTTAGTAT TAAATGAAAA AGCTGGAAAA TCTTCACCAC TCTATCCAGA TAAGCCATAC AACGTATTTG AAAACAACCT TTTCTTATAT GAAAGATTTG AAACATTGTC CGGAAGAATT ACATACTACG TAGATGATGA TTTATGGATA CAGCAAGGAG CAAACGTTCC AACCGCTAAA GAACCTATCA GACCAAGAAG ATTCCCATTT GTTCTCATGA CACCACACGC AAGATGGTCA ATCCACTCAA CTTATAAGAC ATCTACTTTA CTGCTTAGAT TACAAAGAGG AAAGCCTTAT GTAATGATAA ATCCTGAGAT TGCTAAGAAG AAAGGCATCA AAGATGGTGA TGAAGTTAGA GTGTTTAACT CCCTTGGTGA GTTTTATGCT ATGGCAAAAG TATATCCATC ATGTCCAAAA GATGCAATTA TATTGGAGCA TGGTTGGGAA CCATTCTTCT ACAAAGGAAG AAAAGGGCAC AACGAAACAG TAGCATCTCC GCTTAACTTA CTTGAACTTT CTGATGGTTG GGGACACTTG AAGTTTGGTG GAAACTGGGA TGGAAACCAA CATGCGTATG AGACTTCAGT AGATGTTGAA AAAGCTTAA
|
Protein sequence | MAISRRDFLK VMSATGGALG LGSSEAFAKA KAVVVDDPRA SYPNGSFVEN MYRREFAYTY GKKEEHGTAY HCVNCQGNCA WDVWVQNGIV TRENQAANYP QINPKIPDAN PRGCNKGVQH SQVMYEKDRI LYPMKRVGAR GEGKWKRISW DEAITEVATR IYETMLTKGP AGNYIHVGAG MLTEARAASG KRLGTLLGAV RPYIASYVGD MFPGVSLVYG EGNIGFSYDF VYTANVQIWW GMDPNKTRIP DAHWVWEGKY NGGKVIVITP DFNATAKGAD LWVPVRAGYD GFLAMSIINE IIQQKLYKPN FIKVFTDLPF LVRLDNKKLL RLSDIDTNDP MFDKEVFHVM EEKAHGKEFE ADAVFLAYNL KNGKFTIMPG SEGNPVKTLR LKDLGWDIDP ALEGVYEVKL KDGSKVKVTP VFELVKAEAA KFPAEKTFKL TNVHPKIVQQ LARDIALPKV AFISMGFTIG KYFNGMLTQR AIASITPLCG RLGPYGGFNS ENEWSISGLA KISGFAGKYK ERFASGFVSE FVLGNMMQDF DKLYEEDTFK ESMGMSKEEY KKQVNEMLSK SEGDKGIGHG KSYWNDVETF LLFADARFRR NKGSSYKKAF FEKAKFIAYV DFRMSDFANY ADILLPAKSH YEVWDLRTNP GYHRFANLAH PPANLKPVGE AKSEWEICTM IVEKIQEIAT KKYKETGDQK YIKIPDPQLS KTGYRDLDTL VEEYTIGGQL RNDRDAVELA LENTDQFKPN TIESMFKRGG YLVLNEKAGK SSPLYPDKPY NVFENNLFLY ERFETLSGRI TYYVDDDLWI QQGANVPTAK EPIRPRRFPF VLMTPHARWS IHSTYKTSTL LLRLQRGKPY VMINPEIAKK KGIKDGDEVR VFNSLGEFYA MAKVYPSCPK DAIILEHGWE PFFYKGRKGH NETVASPLNL LELSDGWGHL KFGGNWDGNQ HAYETSVDVE KA
|
| |