Gene Moth_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2096 
Symbol 
ID3832462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2186430 
End bp2187434 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content59% 
IMG OID637830021 
Productcytochrome d ubiquinol oxidase, subunit II 
Protein accessionYP_430931 
Protein GI83590922 
COG category[C] Energy production and conversion 
COG ID[COG1294] Cytochrome bd-type quinol oxidase, subunit 2 
TIGRFAM ID[TIGR00203] cytochrome d oxidase, subunit II (cydB) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGA ACATCCTATG GTTCATCCTG GTTACCGTTC TCTTTACCGG TTTCTTTTTC 
CTGGAAGGTT TCGACTATGG CGTCGGGATC CTGCTGCCTT TCGTGGGGAG AAACGACCTT
GAGCGCCGAA TGGTTATTAA TAGCATTGGC CCCTTCTGGG ACGGCAACGA GGTATGGATG
CTCACCGCCG GCGGGGCCAT GTTTGCCGCC TTCCCCCACT GGTACGCCAC CCTTTTCAGC
GGTTTCTACC TGGCCCTGTT CTTGATCCTG GTGGCCCTGA TCCTGCGCGG TGTGGCCTTC
GAGTTCCGCA GCAAGGACGA AAAACCTGCC TGGCGCAACC TCTGGGACTG GTTGCTCTTC
GTGGGGAGTC TACTGCCGGC CCTCCTCTGG GGCGTAGCGA TTACCAACCT CATCCGGGGT
GTGCCTATTG ACGCCAGGAT GCAATTTGCA GGAACCTTTT TCGATCTCCT CTCGCCCTAC
ACCCTGCTGG GGGGGCTGGC CTTCCTCCTG GTCTTTACCC TGCAAGGAGG TCTTTTCCTG
GCCCTGAAAA GTGAAGGTGA ACTAAAGGAG CGTTCCCGGC AGGCGGCCCT GAGGGCTGGA
GCCGGTGCGG CCCTGGCCCT CCTCCTGCTG GTAATTATGA GTTACGGGGT CACCGATATT
TTCAGCCGGT TCCTCCCCGG GATCCTTCTC GGGAGTGCTT TCATTACCTT GCTCCTCTCC
CTGGCCGGTC TTTATACCCG GCGCTACGGC CCGGGCTTCC TGATGAACGG CCTGACCGTA
ATCCTGGTCA CGGCGGGATT TTTCAGCGGG CTCTTCCCGC GGGTGATGGT CTCCAGCCTG
AATCCGGAAT GGAGCATCAC CATTTACCGG GCCGCCTCCA GCCCCTATAC CCTTAAAGTT
ATGACCGTCG TCGCCCTGAC TCTGGTACCC GTAGTCCTGG CCTACCAGGG GTGGACTTAC
TGGGTCTTCC GCCAGCGCGT TAAAGCCAGG GACTTGGAGT ATTAG
 
Protein sequence
MDLNILWFIL VTVLFTGFFF LEGFDYGVGI LLPFVGRNDL ERRMVINSIG PFWDGNEVWM 
LTAGGAMFAA FPHWYATLFS GFYLALFLIL VALILRGVAF EFRSKDEKPA WRNLWDWLLF
VGSLLPALLW GVAITNLIRG VPIDARMQFA GTFFDLLSPY TLLGGLAFLL VFTLQGGLFL
ALKSEGELKE RSRQAALRAG AGAALALLLL VIMSYGVTDI FSRFLPGILL GSAFITLLLS
LAGLYTRRYG PGFLMNGLTV ILVTAGFFSG LFPRVMVSSL NPEWSITIYR AASSPYTLKV
MTVVALTLVP VVLAYQGWTY WVFRQRVKAR DLEY