Gene Moth_1606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1606 
Symbol 
ID3832219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1641469 
End bp1643094 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content57% 
IMG OID637829535 
Product4Fe-4S ferredoxin, iron-sulfur binding 
Protein accessionYP_430455 
Protein GI83590446 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTA TTCCTAAACC TGAGGAATTA GTAAAGATAA ATTACCGGCC TCCCCGGACG 
GGGTGGATGG ATACTCCGGT GCAATTCCGG CGGGGCAACT ACCTATACGC CGCTAAACCC
AAGAGCCTGG AAGTCGTCGG CCTGCCCAAT CCCAGGGAAT GGTCGCCGGA AGACGAGGAT
TGGAAACTAC CGGAAAACTG GCAGGAGATT ATCCTGGAAG GTCTGCGGGA ACGCCTGGGA
CGCTTCCGCT CGCTGCAGGT TTTCATGGAT ATCTGCGTCC GCTGTGGCGC CTGTGCCGAT
AAATGCCATT TCTTCATCGG CACCGGCGAT CCCAAGAATA TGCCTGTCCT GCGGGCCGAG
CTCCTGCGCT CGGTATACCG TCGCGACTTT ACTACCGCCG GTAAGCTCCT GGGAAGACTC
GTCGGCGCCA GGGATTTAAC GGTCGATGTC CTGAAGGAAT GGTTCTACTA CTTTTTTCAG
TGCACCGAGT GCCGCCGCTG CTCCCTCTTC TGCCCCTACG GCATTGATAC GGCGGAAATC
ACCATGATCG GCCGGGAACT CCTCAACCTG GTCGGGTGCA ATATCGACTG GATTGCTTCT
CCGGTGGCCA ACTGCTACCG CACCGGGAAC CACGTCGGCA TCGAACCCCA CGCCTTCAAG
GATATGGTGG AGTTCTGTGT CGACGAAATC GAAAACATAA CCGGCATCAG GGTGGAACCT
ACCTTCAACC GCAAGGGGGC GGAGGTGCTC TTTATCGCCC CTTCCGGCGA CGTTTTCGCT
GACCCCGGGA CCTACACCCT CATGGGCTAT CTTATGCTCT TCCACGAGAT CGGCCTGGAT
TACACCTGGA GTACCTACGC CTCCGAGGGC GGCAACTTTG GTATGTTCAC CTCCCACGAA
ATGATGAAGA GGCTCAACGC CAAGATGTAC GCCGAGGCCA AACGCCTGGG GGTGAAGTGG
ATCCTTGGGG GCGAGTGCGG CCACATGTGG CGGGTCATTA ACCAGTATAT GGATACCATG
AACGGCCCGG CCGATTTCCT GGAAGTGCCC GTTTCCCCCA TCACCGGCAC GAGGTTTGAG
AACGCCAAAT CAACCAAGAT GGTCCATATC ACCGAATTTA CGGCGGACTT GATCAAGCAC
AATAAGCTAA AACTGGACCC CAGCCGCAAC GATAACCTGC GGGTTACCTT CCATGACTCC
TGCAACCCGG CGCGATCCAT GGGGCTTTTT GAGGAACCGC GTTACATCAT CAAGCATGTC
TGCAATAATT TCTTCGAGAT GCCCGAGAAC ACCATCAGGG AAAAGACTTT TTGCTGTGGC
AGCGGTGCCG GCCTTAACGC TGATGAATAT ATGGAGATGC GGATGCGGGG CGGCCTGCCC
CGGGCCAATG CAGTAAAGTA TGTTCACGAA AAATACGGCG TTAATATGCT GGCCTGCATC
TGTGCCGTGG ACCGGGCCGT CTTCCCGGCC TTGATGGAGT ACTGGGTACC CGGGGTTGGA
GTCACCGGCG TCCATGAGCT GGTGGGCAAT GCCCTGGTAA TGAAGGGTGA AAAAGAGAGA
ACGACTAACC TGCGGGGTGA ACCCTTGCCC GGCAAAGAAG GGGCGGTAGA TGGCGATGTA
TCGTGA
 
Protein sequence
MAAIPKPEEL VKINYRPPRT GWMDTPVQFR RGNYLYAAKP KSLEVVGLPN PREWSPEDED 
WKLPENWQEI ILEGLRERLG RFRSLQVFMD ICVRCGACAD KCHFFIGTGD PKNMPVLRAE
LLRSVYRRDF TTAGKLLGRL VGARDLTVDV LKEWFYYFFQ CTECRRCSLF CPYGIDTAEI
TMIGRELLNL VGCNIDWIAS PVANCYRTGN HVGIEPHAFK DMVEFCVDEI ENITGIRVEP
TFNRKGAEVL FIAPSGDVFA DPGTYTLMGY LMLFHEIGLD YTWSTYASEG GNFGMFTSHE
MMKRLNAKMY AEAKRLGVKW ILGGECGHMW RVINQYMDTM NGPADFLEVP VSPITGTRFE
NAKSTKMVHI TEFTADLIKH NKLKLDPSRN DNLRVTFHDS CNPARSMGLF EEPRYIIKHV
CNNFFEMPEN TIREKTFCCG SGAGLNADEY MEMRMRGGLP RANAVKYVHE KYGVNMLACI
CAVDRAVFPA LMEYWVPGVG VTGVHELVGN ALVMKGEKER TTNLRGEPLP GKEGAVDGDV
S