Gene Moth_0292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0292 
Symbol 
ID3832954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp298125 
End bp299192 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content36% 
IMG OID637828227 
Producthypothetical protein 
Protein accessionYP_429169 
Protein GI83589160 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00116575 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAACG ATTTATATCC TCTATTAGGA CAAGGAGAAA GACTGATAGA ACCTATAACA 
CCTGGGCAAA GGTCTGCGAA AAAGAGTTTT CCCAGAACCT ATGAAGAGGC GCGAGAACTT
ATTAAAAACC AGCTAAAAGA TTTGAGATAT GAAATCGATA ATATACCGCA ACAGAAAAGA
GTGGAACAAA TAATATTTAC AGTTAGGCTT AACCATAATT TCCTTGCTAA ATCATATATA
CCTAACACTT TTTTTTATCA AACTGGAATG GAGAATATAG GGTCAAGGCG ATGGATTTAT
AAGGAATCCA ATAAAGAGAA ACCTCAACTT AGTAAACTTC ACTTTGTGAG AGCCGAAATC
AGTAATCTAG CCATTTTAGA GGAAAAACTA AACACTCAGG AGAGTAGGCT TAGTGAGGCT
TTTAAACAAG ATATACAAAA AATTGAAAAG CTGTCTCTCC TTTCTCCCGA AGAAGCAATT
CAGGGGTTTA ACGACGATTG GCAAACAGGT AAAGTGGAGA TTGTCTTACA TCCGTTAAAA
GATAGTTCAG AAGAAGCAGT AAGAAAGTTG AAGGATATTT TATTGGCTAA TGGGGTAAAA
GAAAAATCTA TATTAATTAG GCACTATCCT GGGGGGCCAA CATTTATAAG TGCTAACATA
ACCAGAAAAG CATTACAAGA AATTGGGGAT TTTAATCCTT TGAGAACGGT TCATCCTTTA
AAGATAAACT TTTTCCCTGA ACTCAGGAAA ATAGGCTCAT TTCCTATAAT ACCTACCCCT
CCCGTAGGAA AAACAATATC AACAATAAAG GTTGGTATAT TTGATGGGGG GATAGATGCT
ACAAATCCTT ATCTTGCAAA CTACGTAAAA GAAAATTCCC TAATAAAAAC CAAGCCTCAT
CCTACCTATA TAACTCATGG TACTGCAGTT GCTGGGGTAG TCTTATATGG CCCATTAAAT
AACTATGATA ATAATACAGT TTTGCCTAAT CCGTTCGGAT TAACCCCCGC AGATTGGACA
TATGCACCCG ACATTAAGCT CTCTTGCAAT TTTTCTTCAA GCTTTTAA
 
Protein sequence
MPNDLYPLLG QGERLIEPIT PGQRSAKKSF PRTYEEAREL IKNQLKDLRY EIDNIPQQKR 
VEQIIFTVRL NHNFLAKSYI PNTFFYQTGM ENIGSRRWIY KESNKEKPQL SKLHFVRAEI
SNLAILEEKL NTQESRLSEA FKQDIQKIEK LSLLSPEEAI QGFNDDWQTG KVEIVLHPLK
DSSEEAVRKL KDILLANGVK EKSILIRHYP GGPTFISANI TRKALQEIGD FNPLRTVHPL
KINFFPELRK IGSFPIIPTP PVGKTISTIK VGIFDGGIDA TNPYLANYVK ENSLIKTKPH
PTYITHGTAV AGVVLYGPLN NYDNNTVLPN PFGLTPADWT YAPDIKLSCN FSSSF