Gene Moth_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1949 
Symbol 
ID3832299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2023572 
End bp2026622 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content59% 
IMG OID637829880 
Productputative selenate reductase subunit YgfK 
Protein accessionYP_430790 
Protein GI83590781 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR03315] putative selenate reductase, YgfK subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00511971 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGCG CTATGCGGCC GATACCCTTT AAGAAGCTCT TGGACTGGAT CCTTGAGGAG 
AACCGGAAGT TTCACAGTAT TTTTGGCCTG CCGCAGGAAA AATTTTACCG GGCACAGCCC
GGGGTTTACT GGCAATTATT TGGGGAATAT TTGGAAAACG TTATCGGACC TGCCGCCGGT
CCCCATACCC AGCTGGCTCA GAATATTGTC GCGGCCTACC TGGCCGGCGG CAGGTTTTTC
GAGTTAAAGA CGGTTCAGGT ACTCGACCGG CTGGACATCC CGAAGCCCTG CATCAACGCG
GCGGATGAAG GCTACAACGT GGAGTGGTCA ACCGAACTGG CCATTGAGGA AGCCTTTGAG
GAGTATGTCA AGGCCTGGTT CCTGCTCCAC GTTTTACAAA AGGAACTATG GGGCACGGAC
AGGCGCGGCT TCATGTTCAA TATGAGTGTC GGCTACGATT TAAAGGGGAT CAAATCCCCC
AAAGTCGACC GGTTCATCGA GAGCCTGAAG GACGCCTCGA AGACGGCCAT CTTTCAGGAG
TGCCGGGCGG TTTTGCAGGC CGAGGTGGAC CGGTTTACAG CGGTAGATGC TGAATTTATC
GATGGCATCT CCCCCCATAT CTGCAGTTCC GTCACCCTTT CAACCATGCA CGGCTGCCCG
CCGGCGGAGA TCGAAACCAT CTGCCGTTAC CTCCTGGCGG AGAAAAGGCT GCATACCTTT
GTCAAGCTGA ATCCCACCCT TCTGGGCTAT AAATTTGTTA AAGATACCCT GGCCGGCATG
GGATATAGCT ATGTCCAGTT AAAAGAAGAA TCCTTCAGCC ACGACCTGCA GTACACCGAC
GGTGTAGCCC TGATCGGGCG GCTACAGGAG TTCGCCCGGG AACAGGGTCG GGGCTTCGGC
GTCAAACTAT CCAACACCCT GCCGGTGCAG GTAACCAGGG GCGAACTCCC CGGGGAGGAG
ATGTACCTGT CAGGCCGGGC CCTGTACCCC CTGACCTTAA ACCTTGCCGC CAGGCTGGCA
CAGGAGTTTA ACGGCCACCT GAGAATCTCC TATGCCGGCG GTGGCGACGC CTTTAACCTC
CCCCGCCTCT TCGCGACGGG GATCTGGCCC CTAACGGTGG CTACGACCCT TTTAAAGCCG
GGGGGTTACC TGCGGTTGCA GCAAATAGCC GCAGAACTGG CAACCCGGAT GCCTGACACT
GCCGGCGAGG TTATCGATGT AGCACAACTG GCCGGCCTGG CGGCGGGCGT CACCCGGGAC
CCCGACTTCC GCAAGGAGAA ACGAGGGGTC GCCAGCCGCA AGCTCACCAG GAAGTTACCC
CTGACTGATT GCTTTTTGGC GCCGTGTACC GCCGGCTGTC CCATCGGCCA GGACATACCG
GAGTATATCC GGCTGGTGGG CGAAAAGAGG TACCGCGAGG CCTACGAACT TATTATCGAG
AAAAACCCGC TGCCTTTTAT CACCGGCAGC ATCTGTACCC AGCACTGTGC CGCTAAATGC
ACGCGCCTGG ATTACGACGA ACCGGTGCGC ATCCGGGAAA TGAAGAAGGA GGCGGCCGTA
AAAGGTTACC GGGCTTCCCG GCCCCGGTGC GGGCCGGCGC AAGGAAAGGC TTCCGCCCGG
GTGGCCGTAA TCGGGGCGGG GCCTGCGGGT CTGGCTGCCG GCTACTTCCT GGCCAGGGCC
GGCCTGGGGG TCACCATTTT TGATAAAAAG GGAAAACCTG GTGGTACAGT GACCCATGTG
ATCCCCGATT TCCGTCTTTC TGAGGACGCC ATTGCCAGAG ACCTGGAGCT GGTTAAGGGA
ACTGGTGTCG AGTTTAAACT GGGCGTCAGC CCCGACTTTA ACGTCGCGGA GTTAAAAAGG
GCCGGTTACA AATACGTCTT CCTGGCCCCC GGGGCGGGGG CGTCCAGGCC CCTGGAACTT
AGAACCGGCG GCGAAAGGGT CATGGGCGCC GTGGAATTCC TGGCCAAGTT TAAAGAAGAC
AGGCAGAAGG TCCGCCTGGG TAAAAGGGTG GCCGTCATCG GCGGCGGCAA CACGGCCATG
GATGCCGCTA GGGCGGCCCT GCGGGTCCCA GGTGTCGAGA AAGTTACTAT TATCTACCGC
CGTACCAGGG AGTATATGCC GGCGAGCAGG GAAGAACTCC GGGAGGCCCT GGCCGAGGGC
GTAGTCCTCA AAGAGCTCCT CGCCCCTTAC TCCTGGTCTG AAGGTGTCCT CCGTTGCCAG
CAGATGGAAC TTGAGGCGCC GGATGCCTCG GGACGGCCGG GAGTTGCCGT TAAAGCAGGG
GAGCTTGTGG ATATTCCTGC CGACGCCGTC TTAGCGGCCA TCGGCCAGGA TGTGGACTAC
GGTCTCCTGG AGAAAAACGG TATCGCCATC GACGAAGGGG GGAGGATTGT CGTTGACCCC
GCCACCAACG AGACCAGTGT GGCCAATGTC TTTATCGGCG GCGACGCCCT GCGCGGACCG
GCGACAATAG TTGAGGCCAT TGCCGATGGC CGTAAAGCGG CCAGGGCGAT TCTCACCCGG
GAAGGCCTGA CGCCACCTGT TCCCGGCGCG GTGCCCTTTG ACCGGGAGTG GAGGCTCCGG
GAGGTTAACC AGAAAAAAGG GAACCTGGCC GGGGCGGCAG GAGACCCTGG ACTGGAGCCG
CAGCGCTGCC TGGAGTGCGG TTTTGTCTGT AATATCTGCA CGGAGGTATG TCCCAACAGG
GCCAACATTG CCATCCAAAC ACGTAATGGT GGCTTCCGGG ATCAAAACCA GATAGTGCAT
GTAGATGGTA TGTGCAACGA ATGCGGCAAC TGCGCCACCT TCTGCCCTTA TGACGGCGCA
CCCTATAGGG ATAAATTAAC CCTCTTCTGG AAGGAAGAAG ACTTTGCCGG CAGCCAGAAC
AACGGTTTCC TGCTGTTAGC GGGCGGCGCG GAACTTGTCT TTAAAGTCCG CCTCAACGGC
CGGGTGCAGG AGGTAAAATT CGACCCTGCC GGCAAGGCTA ATGTGGACCT GGAGCAAGGG
GTATTAGACC TTATCCTGGC AGTATATAAG GGCTACAGGT ATCTGTTTTA A
 
Protein sequence
MSGAMRPIPF KKLLDWILEE NRKFHSIFGL PQEKFYRAQP GVYWQLFGEY LENVIGPAAG 
PHTQLAQNIV AAYLAGGRFF ELKTVQVLDR LDIPKPCINA ADEGYNVEWS TELAIEEAFE
EYVKAWFLLH VLQKELWGTD RRGFMFNMSV GYDLKGIKSP KVDRFIESLK DASKTAIFQE
CRAVLQAEVD RFTAVDAEFI DGISPHICSS VTLSTMHGCP PAEIETICRY LLAEKRLHTF
VKLNPTLLGY KFVKDTLAGM GYSYVQLKEE SFSHDLQYTD GVALIGRLQE FAREQGRGFG
VKLSNTLPVQ VTRGELPGEE MYLSGRALYP LTLNLAARLA QEFNGHLRIS YAGGGDAFNL
PRLFATGIWP LTVATTLLKP GGYLRLQQIA AELATRMPDT AGEVIDVAQL AGLAAGVTRD
PDFRKEKRGV ASRKLTRKLP LTDCFLAPCT AGCPIGQDIP EYIRLVGEKR YREAYELIIE
KNPLPFITGS ICTQHCAAKC TRLDYDEPVR IREMKKEAAV KGYRASRPRC GPAQGKASAR
VAVIGAGPAG LAAGYFLARA GLGVTIFDKK GKPGGTVTHV IPDFRLSEDA IARDLELVKG
TGVEFKLGVS PDFNVAELKR AGYKYVFLAP GAGASRPLEL RTGGERVMGA VEFLAKFKED
RQKVRLGKRV AVIGGGNTAM DAARAALRVP GVEKVTIIYR RTREYMPASR EELREALAEG
VVLKELLAPY SWSEGVLRCQ QMELEAPDAS GRPGVAVKAG ELVDIPADAV LAAIGQDVDY
GLLEKNGIAI DEGGRIVVDP ATNETSVANV FIGGDALRGP ATIVEAIADG RKAARAILTR
EGLTPPVPGA VPFDREWRLR EVNQKKGNLA GAAGDPGLEP QRCLECGFVC NICTEVCPNR
ANIAIQTRNG GFRDQNQIVH VDGMCNECGN CATFCPYDGA PYRDKLTLFW KEEDFAGSQN
NGFLLLAGGA ELVFKVRLNG RVQEVKFDPA GKANVDLEQG VLDLILAVYK GYRYLF