Gene Moth_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2307 
Symbol 
ID3831421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2425495 
End bp2426766 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content59% 
IMG OID637830231 
Producthypothetical protein 
Protein accessionYP_431137 
Protein GI83591128 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTATT TAGATGACCT GGCCCTGGTT GGCGAAGAAA TGGCCAAGTG CATGAAGTGC 
GGCAACTGCC AGGCAGTCTG CCCTCTGTAT AAAGAGATCC TGCGGGAAGC GGCTGTGGCC
CGTGGTAAGG TCCAGCTGGC CTACGCCTAC CTGCGGGGAG AAATTCCGGC CACCCATTCC
CTAGCTGAAA AGTTTCTCTT TTGCCTCACT TGTATGGCCT GCGTGGCCAA CTGCCCCAGC
GGCGTCCGGG TGGACAAGGT AGTACTGGCG GCGCGGGCGG CCATGGTGCG GGAAAAAGGG
CTACCCTGGT TAAAAAAGGT TATCTTCCAG GGTTTAAAGC GGTCCTGGCT CTTCGATACG
GCTTTGAAGA CCGGTCGTCA TTTCCAGTCC TTCGCCCTGC GTTACCGGCC GGAAATCCAG
CGCTGTAACC CCCGCTTTCC TATAGGACTG GAACTGAGAC GGGTCCTGCC GCCCCTGGCC
AGGCGCACAT TGAGAGAGGA ATTCCCCGAG GTTATCCGCC CGGTGCAGCC CCGGGCCAGG
GCAACCTTTT TCACAGGTTG CCTGGAGAAC TATATCTACA CTGATATCGG CCGAGCAGTG
GTCAACGTCC TCCTGGCCAA CGATATCGAA GTTATAATCC CCCGGGACCA ACACTGCTGC
GGTATACCCA CCCTGGTCCA CGGCGACGTC CTTACGGCCA GGGAGATGGC TGCTTCCAAC
CTGCATATTT TTCAGGGATA TAACTTTGAC TACCTGGTCA CTGCCTGCGC TACCTGTGGC
GAGGCCTGGA AGCAATACTA TCCCAGCCTC ATGGATAACG GCCCCCTGGG GGAGGTTGCC
GGGCAGATGG CCAGAAAGGC CCGGGACATC AATGAATTCC TGGTGGAGAT TGATTACCGC
GTGCCGACCG GCAGTCTGCC TTTAAAGGTG ACTTACCACG ATCCCTGCCA CCTGGTGCGG
GGCCAGGGGA TTAGCCGCCA GCCGCGGCAA ATCCTGGGTT CCATCTCGGG GGTGGAAGTG
GTGGAAATGA AGAATGCCGA TCGCTGCTGC GGCAGTGCCG GTTCCTTCAG CCTGACTAAT
TACGATGTTT CCATGCAGGT CCAGGCCCAC AAAGTGGCAG CTATCAAGGC CACCGGCGCC
GGTACGGTGG TCACCGCCTG CCCGGCCTGC CGCATGCAAC TGGAAGACGG CCTGGCCCAG
GCCGGCCTGT CCCCCAGTGT CTTCCACGTA GTCCAACTTC TAGAACAGGC TTACGGTATT
ACCGCCAATT AG
 
Protein sequence
MSYLDDLALV GEEMAKCMKC GNCQAVCPLY KEILREAAVA RGKVQLAYAY LRGEIPATHS 
LAEKFLFCLT CMACVANCPS GVRVDKVVLA ARAAMVREKG LPWLKKVIFQ GLKRSWLFDT
ALKTGRHFQS FALRYRPEIQ RCNPRFPIGL ELRRVLPPLA RRTLREEFPE VIRPVQPRAR
ATFFTGCLEN YIYTDIGRAV VNVLLANDIE VIIPRDQHCC GIPTLVHGDV LTAREMAASN
LHIFQGYNFD YLVTACATCG EAWKQYYPSL MDNGPLGEVA GQMARKARDI NEFLVEIDYR
VPTGSLPLKV TYHDPCHLVR GQGISRQPRQ ILGSISGVEV VEMKNADRCC GSAGSFSLTN
YDVSMQVQAH KVAAIKATGA GTVVTACPAC RMQLEDGLAQ AGLSPSVFHV VQLLEQAYGI
TAN