Gene Moth_0136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0136 
Symbol 
ID3830793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp130880 
End bp131968 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID637828070 
Producthypothetical protein 
Protein accessionYP_429018 
Protein GI83589009 
COG category[S] Function unknown 
COG ID[COG1415] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000501513 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.643444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACCG GTACCGCCAG CCTGCCCCTC CACGGCGGCC ATTGCCCGCC CTGGCTCTTC 
GAGCGCATGC AGCGCCTGGG GCCGGCCATC CTGGAGGTAA TTGTGCAGGA ATACGGCCCC
CAGGAGGTTT TAAGACGCTT AAGCGATCCC CACTGGTTCC AGGCCTTCGG CTGTGTCCTG
GGTTTTGACT GGCACTCCTC GGGCCTGACC ACCACCCTTT GCGGCGCCCT CAAAGAAGGT
TTGCGCGGCC GGGAAAAGGA TCTGGGCTTG GTCATAGCCG GCGGCAAGGG CCGTACCTCC
CGCCAGACGC CCCATGAGAT CGAAACGGCG GTCGACAGAT TGGCCCTGAC TTCCCTCGAG
CCTGAAGATC TGGTTTATGC CAGCCGCATG GCGGCCAAGG TCGATAACAC CGCCCTCCAG
GACGGCTACC AGCTCTACCA CCACGTCTTT ATCTTCACCT TTGACGGCCA GTGGGCCGTC
GTCCAGCAGG GGATGAATGA AACCAGCCGC CTGGCCCGGC GCTACCACTG GCTGGGGGAA
GGGATGCAGG ACTTCGCCTG CGAGCCCCAC GCCGCCGTCT GCTGTGACGC CAGGGAAACG
GCCCTGAACA TGGTAGCCAG GGAAAGCGAG GCTTCCCGCC AGGTGGTAAC CGAACTGGTA
CGCCAGCAAC CGGCGAAGGT GGTAGCCGAG TTTAGCCGCA TCCTGGAAAA GGACCTCCCC
AACCTGGCCC TGCCCTGGCG CCACGACGTG CCCCGGGCGG GTTACCTGAA TAAAGCCCTG
TTAAAGGTTT ACGACGTCCA GCCCCGGGAC TTCGCCGGTG TCCTGGGGAT CGAAGGAGTG
GGTCCCAAGA CCATCCGCGC CCTGGCCATG GTGGCCGAAG TGGCCTATGG CGCGCCGGCC
AGCTTCCGGG ACCCCGTTCG CTACAGTTTT AGCCACGGCG GCAAGGACGG CCATCCCTAC
CCCGTCGACC GCCAGGTATA CGACCGCACC ATTAACGTCC TGGAACAGGC CCTGGCGGCC
GCTAAAATCG GTCGGACCGA TAAAATACAG GCTTTAAAAA GGCTGAGCAG ATTGGCTAAT
GGAAGTTAA
 
Protein sequence
MRTGTASLPL HGGHCPPWLF ERMQRLGPAI LEVIVQEYGP QEVLRRLSDP HWFQAFGCVL 
GFDWHSSGLT TTLCGALKEG LRGREKDLGL VIAGGKGRTS RQTPHEIETA VDRLALTSLE
PEDLVYASRM AAKVDNTALQ DGYQLYHHVF IFTFDGQWAV VQQGMNETSR LARRYHWLGE
GMQDFACEPH AAVCCDARET ALNMVARESE ASRQVVTELV RQQPAKVVAE FSRILEKDLP
NLALPWRHDV PRAGYLNKAL LKVYDVQPRD FAGVLGIEGV GPKTIRALAM VAEVAYGAPA
SFRDPVRYSF SHGGKDGHPY PVDRQVYDRT INVLEQALAA AKIGRTDKIQ ALKRLSRLAN
GS