Gene Moth_1633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1633 
Symbol 
ID3831262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1668041 
End bp1669420 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content55% 
IMG OID637829558 
Producthypothetical protein 
Protein accessionYP_430478 
Protein GI83590469 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAG TCTTTGACGA TTACCGGGTG CTCTCAGATG AGACCCTGAA ACCAGACGCG 
CCACGGGTGG AAAAATTCCT CCAGGCCATG GGGAAATCCC TCCGGCAGCG GGATAACTGG
ACCTTTTGGC TCCCCTATAC CCTATCCCTG GATACCTGCA TGAAGTGCGG TACCTGCGCC
GAGGTCTGCC CGGTCTACCT GGCCAGCGGC CGCAAAGACA TCTACCATCC GGTCTACCGT
TCCGACATGC TGCGTAAGGT TTATAAACGG TATTTTACCC TGGCAGGAAG GTTTTTTCCG
GGCCTGGTGG GAGCGGAAGA CCTGACGGAA GATAAACTCA ATGCCATGGC CGAGAATATT
TACCGGTGTA CCATTTGCCG CCGCTGTGCC TATGTTTGCC CGGTAGCCAT TGATAACGGC
TTGATTGCCC GGGAAGCACG GAAAATCTTC GACGCCATCG ATATCGCCCC CGACGAGCTG
AAGAAAAACG GCACCCGGAA ACAGGTCCGG CTGGGTAACG CCACCGGTAT GCCGGCCAAC
GCCTTTTTTG ACATGATTGA GTTCCTGGAG GAAGAGATTG AGGATACACG GGGATATAAA
ATTAAAATAC CGGTTGATAA GCAGGGCGCC GAGTACCTCC TCATGCATAA CGCCGGCGAC
TACCTGGCCT TTGCCGAGAC GGTAATGGGC GCCGCCGAAG TCATGAACGC CGCCGGTGTC
GACTGGACCC TCAATTCCCC GGAAACGGGC CTCAACGATG CCGTCAATTA CGGCGTCTTT
TACAGCGATA CCGAGTTCGC CAGTGTTGCC AGGGCCCATA TCGAAACTGC CAAAAAGTTA
GGGATTAAGA CTTTCGTCGT AGGCGAGTGC GGTCATGCCT TCGAGGCGCT GAAGTACCTG
ATCTTGCGCC TCGTCCCCCC GGAAGAAAGG CCTTTTGAGG TCAAGAGCAT CCTGGAACTG
GAGGATCAAT GGATCCGGGA AGGGCGGATT AAGGTCGACC CCCAGAAGAA CCCTGAACCT
GTGACCTACC ATGATTCCTG CAAGCTGGGC CGCCTGGGAG GGCTCTATGA GGAGCCGCGG
CGCATCCTCA AAGCCTGCTG CACTGATTTT CGCGAAATGA CGCCCAACCG GGAAATGAGT
ATTTGCTGCG GCGGTGGCAG CGGTTTTGCC ATTATGGATA AGGGCGACTT CCTTAAATTC
CGCATGGAAA CCTACGGTAA GCTCAAAGCC GAGCAGCTAA AAGCCACCGG CGCCAGCATT
GTAGCCCTGG CCTGCTCCAA TTGTAAGGGC CAGTTCCGGG AGATTATCAA CTACTATAAG
CTGCCGGTAC GTTTCATGGG TGTCAGTGAG CTGGTGGCTA ATGCCCTGGT GTACAATTAA
 
Protein sequence
MRKVFDDYRV LSDETLKPDA PRVEKFLQAM GKSLRQRDNW TFWLPYTLSL DTCMKCGTCA 
EVCPVYLASG RKDIYHPVYR SDMLRKVYKR YFTLAGRFFP GLVGAEDLTE DKLNAMAENI
YRCTICRRCA YVCPVAIDNG LIAREARKIF DAIDIAPDEL KKNGTRKQVR LGNATGMPAN
AFFDMIEFLE EEIEDTRGYK IKIPVDKQGA EYLLMHNAGD YLAFAETVMG AAEVMNAAGV
DWTLNSPETG LNDAVNYGVF YSDTEFASVA RAHIETAKKL GIKTFVVGEC GHAFEALKYL
ILRLVPPEER PFEVKSILEL EDQWIREGRI KVDPQKNPEP VTYHDSCKLG RLGGLYEEPR
RILKACCTDF REMTPNREMS ICCGGGSGFA IMDKGDFLKF RMETYGKLKA EQLKATGASI
VALACSNCKG QFREIINYYK LPVRFMGVSE LVANALVYN