Gene Moth_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0479 
Symbol 
ID3832417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp483662 
End bp484786 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content66% 
IMG OID637828413 
Productmetallophosphoesterase 
Protein accessionYP_429352 
Protein GI83589343 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCCGCG TCCTGCACCT GGCGGACCTG CACCTGGGGT ACCGGCCTGA CCTGCCGGCA 
CCGGTCCGGG AGGAGGTCTA CCGGGCGCGC AACAGGGTTT TGCAGGCGGC CGTGGACCTG
GCCCTGGACC CCCGGCAGGG AATCAGCCTG GTCCTCATCG CCGGGGACCT CTTTGACAAC
CACCGGCCGG AGGCTTCCCT GGTGGAGGAA ACCATCCGCC AGCTAACCCG GCTGGAAGCC
GCCGGCATCC AGGTGATAAC GGTGCCGGGG AACCATGATG AGATTACATA TAACGATGCG
GTATACCGCC GGGAAGCCGG CCGCTGGCCG GGCCTCCTGG TGACCGATCC CATGCCGGCT
AAAGTGGCTA CCCTAAAAGT CAAGGGCGAC ACCTGTCACC TAATGGCCAT GGCCTACACC
GGCGGTGTGA CCAGGGTTGA CGGCCCGTTA AAGGCCTTCC CTCCGGCCGA GGGGGAAGGG
GTGAACCTGG CGGTTTTCCA CGGCTCCCTG GACTGGGACG GCGGGGAAAG GAGCTTGCCC
CTGGATGGGG AAGCCCTGGC AGCCGCCGGG TATGACTACA TAGCCCTGGG CCACATCCAC
CGCGGGGGAC AAAGGAGGTT GGGCCGGGGA CTGGCCGTCT ACGCCGGCAT GGTGGCCGGC
AAGGGCTTCG CCGATCCCGG GACCGGCTCT TGGACCATAG TCACCCTGGG GGACGGCCCG
GCCCGGGTGG AGCAGGTACC GGCCCGGGTA ACGCCCTGGC GGCTCGTTGA CCTGGACGTT
ACGGCTTTCC AGGAACCGGA GGAACTGGAG ACGGCCGTCA GGGACGCCCT TGACCCCGGT
GCCCTGATGC AGGTGCGCCT GACGGGCAGC CTCGCCTTTG ATCTGGACCT TGAGGGCCTC
CAGGCCCGCC TGGGTCACCC CTGCCCATAC CTGGAACTGG TGAACGAAAC GGCAGGATGG
ACGCCGGAAA TCCTGGAGAG TATCGCCGCT GAACCAACCA TCAGGGGCCT TTTTGTCCGC
CGCCTGCAAG AAAAGCTTGC CGGCTCTGCG GGACCCGGGG AAGCAGAGGT GATCCGCCGG
GCGATTTTTC GCGGCCTCCT GGCCCTGAAG GGGGGTAGCA GGTAA
 
Protein sequence
MFRVLHLADL HLGYRPDLPA PVREEVYRAR NRVLQAAVDL ALDPRQGISL VLIAGDLFDN 
HRPEASLVEE TIRQLTRLEA AGIQVITVPG NHDEITYNDA VYRREAGRWP GLLVTDPMPA
KVATLKVKGD TCHLMAMAYT GGVTRVDGPL KAFPPAEGEG VNLAVFHGSL DWDGGERSLP
LDGEALAAAG YDYIALGHIH RGGQRRLGRG LAVYAGMVAG KGFADPGTGS WTIVTLGDGP
ARVEQVPARV TPWRLVDLDV TAFQEPEELE TAVRDALDPG ALMQVRLTGS LAFDLDLEGL
QARLGHPCPY LELVNETAGW TPEILESIAA EPTIRGLFVR RLQEKLAGSA GPGEAEVIRR
AIFRGLLALK GGSR