Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1949 |
Symbol | |
ID | 3832299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2023572 |
End bp | 2026622 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829880 |
Product | putative selenate reductase subunit YgfK |
Protein accession | YP_430790 |
Protein GI | 83590781 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases |
TIGRFAM ID | [TIGR03315] putative selenate reductase, YgfK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00511971 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGGCG CTATGCGGCC GATACCCTTT AAGAAGCTCT TGGACTGGAT CCTTGAGGAG AACCGGAAGT TTCACAGTAT TTTTGGCCTG CCGCAGGAAA AATTTTACCG GGCACAGCCC GGGGTTTACT GGCAATTATT TGGGGAATAT TTGGAAAACG TTATCGGACC TGCCGCCGGT CCCCATACCC AGCTGGCTCA GAATATTGTC GCGGCCTACC TGGCCGGCGG CAGGTTTTTC GAGTTAAAGA CGGTTCAGGT ACTCGACCGG CTGGACATCC CGAAGCCCTG CATCAACGCG GCGGATGAAG GCTACAACGT GGAGTGGTCA ACCGAACTGG CCATTGAGGA AGCCTTTGAG GAGTATGTCA AGGCCTGGTT CCTGCTCCAC GTTTTACAAA AGGAACTATG GGGCACGGAC AGGCGCGGCT TCATGTTCAA TATGAGTGTC GGCTACGATT TAAAGGGGAT CAAATCCCCC AAAGTCGACC GGTTCATCGA GAGCCTGAAG GACGCCTCGA AGACGGCCAT CTTTCAGGAG TGCCGGGCGG TTTTGCAGGC CGAGGTGGAC CGGTTTACAG CGGTAGATGC TGAATTTATC GATGGCATCT CCCCCCATAT CTGCAGTTCC GTCACCCTTT CAACCATGCA CGGCTGCCCG CCGGCGGAGA TCGAAACCAT CTGCCGTTAC CTCCTGGCGG AGAAAAGGCT GCATACCTTT GTCAAGCTGA ATCCCACCCT TCTGGGCTAT AAATTTGTTA AAGATACCCT GGCCGGCATG GGATATAGCT ATGTCCAGTT AAAAGAAGAA TCCTTCAGCC ACGACCTGCA GTACACCGAC GGTGTAGCCC TGATCGGGCG GCTACAGGAG TTCGCCCGGG AACAGGGTCG GGGCTTCGGC GTCAAACTAT CCAACACCCT GCCGGTGCAG GTAACCAGGG GCGAACTCCC CGGGGAGGAG ATGTACCTGT CAGGCCGGGC CCTGTACCCC CTGACCTTAA ACCTTGCCGC CAGGCTGGCA CAGGAGTTTA ACGGCCACCT GAGAATCTCC TATGCCGGCG GTGGCGACGC CTTTAACCTC CCCCGCCTCT TCGCGACGGG GATCTGGCCC CTAACGGTGG CTACGACCCT TTTAAAGCCG GGGGGTTACC TGCGGTTGCA GCAAATAGCC GCAGAACTGG CAACCCGGAT GCCTGACACT GCCGGCGAGG TTATCGATGT AGCACAACTG GCCGGCCTGG CGGCGGGCGT CACCCGGGAC CCCGACTTCC GCAAGGAGAA ACGAGGGGTC GCCAGCCGCA AGCTCACCAG GAAGTTACCC CTGACTGATT GCTTTTTGGC GCCGTGTACC GCCGGCTGTC CCATCGGCCA GGACATACCG GAGTATATCC GGCTGGTGGG CGAAAAGAGG TACCGCGAGG CCTACGAACT TATTATCGAG AAAAACCCGC TGCCTTTTAT CACCGGCAGC ATCTGTACCC AGCACTGTGC CGCTAAATGC ACGCGCCTGG ATTACGACGA ACCGGTGCGC ATCCGGGAAA TGAAGAAGGA GGCGGCCGTA AAAGGTTACC GGGCTTCCCG GCCCCGGTGC GGGCCGGCGC AAGGAAAGGC TTCCGCCCGG GTGGCCGTAA TCGGGGCGGG GCCTGCGGGT CTGGCTGCCG GCTACTTCCT GGCCAGGGCC GGCCTGGGGG TCACCATTTT TGATAAAAAG GGAAAACCTG GTGGTACAGT GACCCATGTG ATCCCCGATT TCCGTCTTTC TGAGGACGCC ATTGCCAGAG ACCTGGAGCT GGTTAAGGGA ACTGGTGTCG AGTTTAAACT GGGCGTCAGC CCCGACTTTA ACGTCGCGGA GTTAAAAAGG GCCGGTTACA AATACGTCTT CCTGGCCCCC GGGGCGGGGG CGTCCAGGCC CCTGGAACTT AGAACCGGCG GCGAAAGGGT CATGGGCGCC GTGGAATTCC TGGCCAAGTT TAAAGAAGAC AGGCAGAAGG TCCGCCTGGG TAAAAGGGTG GCCGTCATCG GCGGCGGCAA CACGGCCATG GATGCCGCTA GGGCGGCCCT GCGGGTCCCA GGTGTCGAGA AAGTTACTAT TATCTACCGC CGTACCAGGG AGTATATGCC GGCGAGCAGG GAAGAACTCC GGGAGGCCCT GGCCGAGGGC GTAGTCCTCA AAGAGCTCCT CGCCCCTTAC TCCTGGTCTG AAGGTGTCCT CCGTTGCCAG CAGATGGAAC TTGAGGCGCC GGATGCCTCG GGACGGCCGG GAGTTGCCGT TAAAGCAGGG GAGCTTGTGG ATATTCCTGC CGACGCCGTC TTAGCGGCCA TCGGCCAGGA TGTGGACTAC GGTCTCCTGG AGAAAAACGG TATCGCCATC GACGAAGGGG GGAGGATTGT CGTTGACCCC GCCACCAACG AGACCAGTGT GGCCAATGTC TTTATCGGCG GCGACGCCCT GCGCGGACCG GCGACAATAG TTGAGGCCAT TGCCGATGGC CGTAAAGCGG CCAGGGCGAT TCTCACCCGG GAAGGCCTGA CGCCACCTGT TCCCGGCGCG GTGCCCTTTG ACCGGGAGTG GAGGCTCCGG GAGGTTAACC AGAAAAAAGG GAACCTGGCC GGGGCGGCAG GAGACCCTGG ACTGGAGCCG CAGCGCTGCC TGGAGTGCGG TTTTGTCTGT AATATCTGCA CGGAGGTATG TCCCAACAGG GCCAACATTG CCATCCAAAC ACGTAATGGT GGCTTCCGGG ATCAAAACCA GATAGTGCAT GTAGATGGTA TGTGCAACGA ATGCGGCAAC TGCGCCACCT TCTGCCCTTA TGACGGCGCA CCCTATAGGG ATAAATTAAC CCTCTTCTGG AAGGAAGAAG ACTTTGCCGG CAGCCAGAAC AACGGTTTCC TGCTGTTAGC GGGCGGCGCG GAACTTGTCT TTAAAGTCCG CCTCAACGGC CGGGTGCAGG AGGTAAAATT CGACCCTGCC GGCAAGGCTA ATGTGGACCT GGAGCAAGGG GTATTAGACC TTATCCTGGC AGTATATAAG GGCTACAGGT ATCTGTTTTA A
|
Protein sequence | MSGAMRPIPF KKLLDWILEE NRKFHSIFGL PQEKFYRAQP GVYWQLFGEY LENVIGPAAG PHTQLAQNIV AAYLAGGRFF ELKTVQVLDR LDIPKPCINA ADEGYNVEWS TELAIEEAFE EYVKAWFLLH VLQKELWGTD RRGFMFNMSV GYDLKGIKSP KVDRFIESLK DASKTAIFQE CRAVLQAEVD RFTAVDAEFI DGISPHICSS VTLSTMHGCP PAEIETICRY LLAEKRLHTF VKLNPTLLGY KFVKDTLAGM GYSYVQLKEE SFSHDLQYTD GVALIGRLQE FAREQGRGFG VKLSNTLPVQ VTRGELPGEE MYLSGRALYP LTLNLAARLA QEFNGHLRIS YAGGGDAFNL PRLFATGIWP LTVATTLLKP GGYLRLQQIA AELATRMPDT AGEVIDVAQL AGLAAGVTRD PDFRKEKRGV ASRKLTRKLP LTDCFLAPCT AGCPIGQDIP EYIRLVGEKR YREAYELIIE KNPLPFITGS ICTQHCAAKC TRLDYDEPVR IREMKKEAAV KGYRASRPRC GPAQGKASAR VAVIGAGPAG LAAGYFLARA GLGVTIFDKK GKPGGTVTHV IPDFRLSEDA IARDLELVKG TGVEFKLGVS PDFNVAELKR AGYKYVFLAP GAGASRPLEL RTGGERVMGA VEFLAKFKED RQKVRLGKRV AVIGGGNTAM DAARAALRVP GVEKVTIIYR RTREYMPASR EELREALAEG VVLKELLAPY SWSEGVLRCQ QMELEAPDAS GRPGVAVKAG ELVDIPADAV LAAIGQDVDY GLLEKNGIAI DEGGRIVVDP ATNETSVANV FIGGDALRGP ATIVEAIADG RKAARAILTR EGLTPPVPGA VPFDREWRLR EVNQKKGNLA GAAGDPGLEP QRCLECGFVC NICTEVCPNR ANIAIQTRNG GFRDQNQIVH VDGMCNECGN CATFCPYDGA PYRDKLTLFW KEEDFAGSQN NGFLLLAGGA ELVFKVRLNG RVQEVKFDPA GKANVDLEQG VLDLILAVYK GYRYLF
|
| |