Gene Moth_0507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0507 
Symbol 
ID3831809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp525513 
End bp526673 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID637828441 
Product4Fe-4S ferredoxin, iron-sulfur binding 
Protein accessionYP_429380 
Protein GI83589371 
COG category[C] Energy production and conversion 
COG ID[COG1600] Uncharacterized Fe-S protein 
TIGRFAM ID[TIGR00276] iron-sulfur cluster binding protein, putative 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCTCCA CAAAGCTAAC AAAGGAGCTG AAGGAATGGG CCGCGGCGCG GGGCCTGGTA 
CTGGGAGTGG CCCCGGTCCG GCCCTTTGAA CGTGGCCGGC AGGCCCTGGC TTGGCTGGCC
CGCCGGGGGC TGGCCACACC CTTTGCTACC GGCCGGCCGG AAGAGCGCTA CCGTCCGGAC
CTCCTTTATC CGGCAGCCCG TTCCCTGATC GTGGTGGCCA GGCCCCACCC ACAGCCGGTC
CGGCCACCGG GCCCCGGGGA GGGGCGCATC GCCCGCTACG CCCTGGGACC GGATTACCAC
CTTGAGCTCC GGGCCAGCCT GGAAGCTCTG GCCGGAATGC TGCAGCGGGC CGGTGCCAGG
CTCACAGCCG TCCAGGTGGA CAACGGCCCC CTCCTGGAGC GAGAGGCCGC TTACCTGGCC
GGCCTGGGCT ATTACGGTGC CAGCTGCAAC CTGATTATCC CGGGCCTGGG CACGGGCTGC
GCCCTGGGAC TCCTCATCAC TGACCTGGAG CTGGAGGCGG GCCAACCCCT GACTCAGGCC
ACCTGCCGCA ACTGTGGCCG CTGCCTGGCG GCCTGCCCCA CAGGCGCCCT GGTGGCCCCC
GGCCGCTTAC ACCCGGAGCT TTGCCTCTCT TACCTGACCC AGAAACGGGG GGTGATTCCG
GTAGAATTAC GCCCGGCCCT GGGCCGGCAC ATCTGGGGGT GTGACGCCTG CCAGGAGGTC
TGCCCGGAGA ATCGGGCGGA ATTCTCCCCG GCGGACCAGG GGGCGTCCCG GGCGGGGCCG
GATGGTCAGT GGCAGGGGCC CGGCAGGGAC CTGCCAGATG GCGGTACAGG AACCGGCCTC
ATCTACCCCG AACTGGCGAC GATCATCACC ATGGATAAGG CCGGTTTTAA CCGCGTCTTC
GGTGGGACGG CCCTGGCCTG GCGGGGGAAA ACCCTCCTCC AGCGTAACGC GGCCATTGCC
CTGGGAAACC TGGGCGACGC CCGGGCCCTG GGACCTCTGC AGCAAGCCCT CCGCGCTCCG
GCGGCCGTCC TCCGGGGTCA CGCCGCCTGG GCCCTGGGCC GCCTGGGGCC GCCCGCCCGG
CCGGCCCTGG CTGTAGCCCT GGCTACCGAG ACAGATCCCT GGGTCCGCCG GGAGATCAGG
ACGGCCCTGG ACAGCTGCTG A
 
Protein sequence
MGSTKLTKEL KEWAAARGLV LGVAPVRPFE RGRQALAWLA RRGLATPFAT GRPEERYRPD 
LLYPAARSLI VVARPHPQPV RPPGPGEGRI ARYALGPDYH LELRASLEAL AGMLQRAGAR
LTAVQVDNGP LLEREAAYLA GLGYYGASCN LIIPGLGTGC ALGLLITDLE LEAGQPLTQA
TCRNCGRCLA ACPTGALVAP GRLHPELCLS YLTQKRGVIP VELRPALGRH IWGCDACQEV
CPENRAEFSP ADQGASRAGP DGQWQGPGRD LPDGGTGTGL IYPELATIIT MDKAGFNRVF
GGTALAWRGK TLLQRNAAIA LGNLGDARAL GPLQQALRAP AAVLRGHAAW ALGRLGPPAR
PALAVALATE TDPWVRREIR TALDSC