Gene Moth_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2088 
Symbol 
ID3831838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2180515 
End bp2181921 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content61% 
IMG OID637830014 
Producthypothetical protein 
Protein accessionYP_430924 
Protein GI83590915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT GGTTTCAGGC GTTGATACTG GCTGTGGGCC TGATCCTGGC CCTGGCGCCG 
GCCGCCCTGG CCGGGACGGT CACCGGCGAC CAGCTCCGCC AGGTATTTCC CCAGGCTCCG
GCGCCGGGGA AGACCATAAC CCGGGGTGAG TTTGCCGCCC TGCTGGCCCG GGCGGCGGGG
ATGCAGGTGA AGGGTAACCA GGCTGATATC CAGGGGGATG CCTGGTACAG CCCGGCAGTA
ATGGCTTTGA AGGAGAAGGG CGTCATCAGG GGCTACCCCG ACGGCGGCCT GCACACCGAC
CAGCCGGTCA GCCTGCTGGA AGCGGCGGTG ATGGTTTCCC GGGTCCTGGG GTTGCCGGAC
GGTGTCGCCG CGCCGGAGGT AAAGGGGTCC CTGGGGCGGG AGAGCTGGGG TTATACCCCC
TATGCCTGGC TGGTACGGGC CGGCCTGCTG CAGCCCGGCC AGGACGCCGG GGGATTCCTT
ACCGTGGACG AAGGCATTGC TTTCCTGGCC GGGGTTTTTG GCAGCGATCC GGAGGCGGAA
AAGATTGCTC AGGCCGCCCA GCAGGCCCAG GCTAAGGTCA AAGACCTGAA ATTCGCCGGC
AGCATGGCTA TAAGCGTGCG CCTGCGGCCG GGGGTGGCCG GGGAAGTGCC GGCAGTTTTT
TCCATGCAGG GCAATATCAT GCAGGGCAAT ATCGAGAGCG AGTTCAGCTA TCCCCTGAGC
CTGCACCAGA AGGTGGACAT GACCCTTCGC TTGCCGGTAG AGAAACTGCC CGGTAAAGAC
CTGTCAACGG GCGGTAAGAT GCAGATGACC ATGGAACAGT ACCTGGTGGA CGGGACGATG
TACCAGAAGG TAGAGGCTCC CGGTATGGAA AAACCCCAGT GGATGAAGCT GCCCAAAGGA
GCCCTGCCGG ACCTGGAAGC CTTGGTGGAA CAGAGCAGGA ACTCGGCAGG GTTACCGCCG
GGGCTAAAGG ACAGCTTCCA TTTCCAGTAC TTGGGTGAGG GTATAGAGAA CGGGCATAAG
GTTCACCGTA TCGCCTACTA CGGCCGGATT GACGACTGGC AGGCCCTGAT AAAGGCCCTG
CCCGGAGGGT TGACCACGGA GATGGAGCAG GCCCTGAACC AGGCCGGCGG CGTCTTGAAG
TCCATTTCCT TCTGGGGTGT GGAAGCCATC GGCGTGGAGG ACAATCTTAC TTATGCCTCG
GAAATGACCA GCCTGGTCGC TTTTGCGGAT AAATACCAGG AAGAAATTGT GCCCCTGGAA
ACAATGACCA TCAACGTGAA GGTTACGGAT TTTCAGTATA ACAGTGGCGT AAAGATCCAG
GTGCCTGCCG AGGCCCTGAC GGCACCGGAA GTACCCCTGA CACCCTCACA ACCGGATGCA
AAATCATCCG GGAGCCAGCA GATGTAA
 
Protein sequence
MKKWFQALIL AVGLILALAP AALAGTVTGD QLRQVFPQAP APGKTITRGE FAALLARAAG 
MQVKGNQADI QGDAWYSPAV MALKEKGVIR GYPDGGLHTD QPVSLLEAAV MVSRVLGLPD
GVAAPEVKGS LGRESWGYTP YAWLVRAGLL QPGQDAGGFL TVDEGIAFLA GVFGSDPEAE
KIAQAAQQAQ AKVKDLKFAG SMAISVRLRP GVAGEVPAVF SMQGNIMQGN IESEFSYPLS
LHQKVDMTLR LPVEKLPGKD LSTGGKMQMT MEQYLVDGTM YQKVEAPGME KPQWMKLPKG
ALPDLEALVE QSRNSAGLPP GLKDSFHFQY LGEGIENGHK VHRIAYYGRI DDWQALIKAL
PGGLTTEMEQ ALNQAGGVLK SISFWGVEAI GVEDNLTYAS EMTSLVAFAD KYQEEIVPLE
TMTINVKVTD FQYNSGVKIQ VPAEALTAPE VPLTPSQPDA KSSGSQQM