Gene Moth_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1789 
Symbol 
ID3832455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1844146 
End bp1845375 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content62% 
IMG OID637829714 
Productmetallophosphoesterase 
Protein accessionYP_430633 
Protein GI83590624 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCAGG TCCTATTTTC TGGCGGTGAT AATATGATCC GTTTTCTCCA TACAGCCGAC 
TGGCAGGTGG GTATGAAGGC CCGGCACGTG GCGCCGGTAG CCGCCCGGGT GCGGGAGGCG
CGCCTGGAGA CGGCCCGGCG TTTAATGGAA ATCGCCCGGG AGCGCCGGCT GGACTTTATC
ATCATTGCCG GCGATGTCTT TGAAGACAAC CAGGTAGATA ATAAACTCGC CCACCAGGTA
GTCCAGATTC TCTCCCTGGC CGCACCGGTT CCAGTCTATA TCCTCCCCGG TAACCATGAC
CCCCTGACTC CTGATGCCGT TTATGAGCGC CGCGTCTTCA GGGAGGGTCT GGCCCCCAAT
ATTCATCTGT TGCGTACTAA CCAGCCCGTT ACTGTTCTGC CCGGTGTGGT CCTGCTGCCT
GCGCCCAACC GGGCTAAAAA TTCCCCGGAA GACCCCACAG AAAAGATGGC ACCGGTACCG
GGAGGAGTCA TCAACATCGG CGTTGCCCAT GGCTCCTTGC GCATCGAGGG GCGTTACCAG
TCCGATGACT TTCCCATTCC GCCGGAGGCG GCGGAACGCC GGGGCCTGGA TTACCTGGCC
CTGGGCCACT GGCATTCCTT TTTCCAGTAC GGAGACCGGA CTTTTTATCC CGGCACTCCG
GAACCCACCG GCTTTGAGGA ACGAGACAGC GGCACAGCAG CCCTGGTAAC TATTGAAGGG
TACGGCGCGC CGCCCCGGGT GGAAAAGATA AAGACCGGCA CCCTGGGCTG GGAAACCTGG
CGGCAGGAGG TCCACGGCGA CCCCCGGGAA ACCGTCCGGG CTCTTAAACT CCGGGTGGAA
GGCCTGGCCA GCCCCGGCCA GACCCTCCTG CGCCTCGCCC TTTACGGGCG GGATACCAGC
GGGGATCAAT CCTGGCTGAA GGAACTCCGG GACTGGCTGG AGGCCCGGTT GCTATACCTC
GATCTGGATA CCACCCGCCT GGCTACGCAA CCCCTGACGC CAAAGCTCCA GAACAGGGCT
CGCTCCCAGC CCTTCCTGCG GGCTGCCATC GCCGACCTGG CCGCGTTGGC CAGGAGCCTG
GGGCAACCCC TGGAAGACGT GGAAAATGTA GCAGACCCGG GAACCAGTCT GGATGCCGAC
CTGATCGACA GGATACTCCG GGCCGGCATC AAACCCGGGG ACGTCCGGGA GGCCCTGGGA
GTGCTGGCCG AAGTAGTAGA GGAGGTCTAA
 
Protein sequence
MHQVLFSGGD NMIRFLHTAD WQVGMKARHV APVAARVREA RLETARRLME IARERRLDFI 
IIAGDVFEDN QVDNKLAHQV VQILSLAAPV PVYILPGNHD PLTPDAVYER RVFREGLAPN
IHLLRTNQPV TVLPGVVLLP APNRAKNSPE DPTEKMAPVP GGVINIGVAH GSLRIEGRYQ
SDDFPIPPEA AERRGLDYLA LGHWHSFFQY GDRTFYPGTP EPTGFEERDS GTAALVTIEG
YGAPPRVEKI KTGTLGWETW RQEVHGDPRE TVRALKLRVE GLASPGQTLL RLALYGRDTS
GDQSWLKELR DWLEARLLYL DLDTTRLATQ PLTPKLQNRA RSQPFLRAAI ADLAALARSL
GQPLEDVENV ADPGTSLDAD LIDRILRAGI KPGDVREALG VLAEVVEEV