Gene Moth_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1160 
Symbol 
ID3833128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1192827 
End bp1194509 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content52% 
IMG OID637829091 
Productmetallophosphoesterase 
Protein accessionYP_430017 
Protein GI83590008 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000153415 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT TTGCTGCCCT CTTCCTGGCT GTAGTTATAG CCGTGACTAT GCTGGCGGCC 
CCGGCTGCTT CCTCTGCCGC TGGCACCCCC CAGTATGATC TAGGTGCTTC GGCCCAACCG
GATCATATTA CCCTCACCTG GACCCAAGAT CCCCTGACAA CCCAGACCAT TACCTGGAGA
ACTAACATTA CTATAGCCAG GGGGCTTGTC CAGTATGCCA AAGCCGCGGA TAAGGCCTCT
TTCCCCGGCA AAGCCGCTAC CGTAGAAGCT ACAGTGCAGA AGTTTACCTC CGATCTGGGG
GATATGAACA TCCATACCGC CACCCTTACC GGCCTCGAAC CCGGCACCGA GTATATCTAT
AGGGTTGGCG ACGGCACCAA CTGGAGCGAC ATCCACACCT TCACCACAGA AGCCAGCAAC
ACTCACTCTT TCAAATTCCT TATCTTTGGC GACAGCCAGA GCGGCGACCC CCTAAATCCG
GAATATAAAC CCTGGCACGA TACCATCCAG AACGCCTTCA AAACTAACAC CGACGCTAAA
TTCTTTGTCA ATGTCGGCGA CCTGGTCGAA CAGGGACAGA ATTATGTCCA CTGGAATAAA
TGGTTCGAGG CCGCCAAAGG TGTTATTGAT ACCATCCCGG CCATGGCCAC CCAGGGCAAC
CACGAGACTT ACAACCCGCC TGATGGCCAT TCAACTAAAC CGATTTTTTG GACTACCCAG
TTCAAACTGC CCCAGAACGG CCCGGAGGGC CTGAAAGGCC AGGCTTATTC CTTTGATTAT
GGGAACGCCC ATATTGTAAT GCTCGACAGC CAGGAAGAAG AAGAAAAGGG TGTGGCCGGG
GATATTCTGG CGGCCCAAAA GGCCTGGCTG GAAAAAGACC TTCAGAATAC CAATAAGCCC
TGGAAACTGG TCTTCTTCCA TAAAACACCT TATTATAATA AGGCTACCCG TACCAACGAA
GATATTAAAG CCGCCTTCCA GCCCCTCTTC GATAAATACC ACGTTGACGT AGTTTTTAAC
GGCCACGACC ATGCCGTCGC GCGGACCTAC CCCATAGCCG GCGATAAGTT TGTCAGCAGC
CCGGCTAAAG GCACCATCTA CTATCTCACC GGTAGAAGCG GTAATAAGTA TTACCCCGAC
CTGTCGGCCA AGGTATGGGA CGCCTTCTTC TACGACCCTC AAGATCAACC CAACTATATT
GTAGCTGAAT TGAATGGGGA TAAATTGACC CTCAGGGCTA TGAAGCAAGA TGGCACCCCC
ATCGATACCT ACACCATCGA TAAAGCCAGC GGGCTGGATA CGCCCCAGAC TATTGTCCCG
CCTAAATATA ACTCCACCAG GTTGGTGATC TTCGGTAACA TGCTCCAGCA GCCCCTGCTG
CCGGTAACCC CCAAGCAGGT CAATGGCCAG TGGTATATCC CCGTAAGGGC CTTTATGCAG
TTCCTGGGCG GCAATGTGGC CTGGTATGAT GACGGCAGCG TAACCATCGT TTATGGTAAA
GACAAGGTGC AAATGGCCAG CAAGAGCGCC CGGGCCACCA TCAACGGCCA GGAAGTAAAC
CTGCCCGGCA GTAGCCTGAT GGACAAAAAT ACTCTTTTTA TACCGGCTGC CGACCTGGAG
GAATTCTTTG GTTTCAGCTA CAAGTATGAT GCCGCCACCA ATATGCTGAT GTTTACCAAA
TAA
 
Protein sequence
MKKFAALFLA VVIAVTMLAA PAASSAAGTP QYDLGASAQP DHITLTWTQD PLTTQTITWR 
TNITIARGLV QYAKAADKAS FPGKAATVEA TVQKFTSDLG DMNIHTATLT GLEPGTEYIY
RVGDGTNWSD IHTFTTEASN THSFKFLIFG DSQSGDPLNP EYKPWHDTIQ NAFKTNTDAK
FFVNVGDLVE QGQNYVHWNK WFEAAKGVID TIPAMATQGN HETYNPPDGH STKPIFWTTQ
FKLPQNGPEG LKGQAYSFDY GNAHIVMLDS QEEEEKGVAG DILAAQKAWL EKDLQNTNKP
WKLVFFHKTP YYNKATRTNE DIKAAFQPLF DKYHVDVVFN GHDHAVARTY PIAGDKFVSS
PAKGTIYYLT GRSGNKYYPD LSAKVWDAFF YDPQDQPNYI VAELNGDKLT LRAMKQDGTP
IDTYTIDKAS GLDTPQTIVP PKYNSTRLVI FGNMLQQPLL PVTPKQVNGQ WYIPVRAFMQ
FLGGNVAWYD DGSVTIVYGK DKVQMASKSA RATINGQEVN LPGSSLMDKN TLFIPAADLE
EFFGFSYKYD AATNMLMFTK