Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1789 |
Symbol | |
ID | 3832455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1844146 |
End bp | 1845375 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829714 |
Product | metallophosphoesterase |
Protein accession | YP_430633 |
Protein GI | 83590624 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCAGG TCCTATTTTC TGGCGGTGAT AATATGATCC GTTTTCTCCA TACAGCCGAC TGGCAGGTGG GTATGAAGGC CCGGCACGTG GCGCCGGTAG CCGCCCGGGT GCGGGAGGCG CGCCTGGAGA CGGCCCGGCG TTTAATGGAA ATCGCCCGGG AGCGCCGGCT GGACTTTATC ATCATTGCCG GCGATGTCTT TGAAGACAAC CAGGTAGATA ATAAACTCGC CCACCAGGTA GTCCAGATTC TCTCCCTGGC CGCACCGGTT CCAGTCTATA TCCTCCCCGG TAACCATGAC CCCCTGACTC CTGATGCCGT TTATGAGCGC CGCGTCTTCA GGGAGGGTCT GGCCCCCAAT ATTCATCTGT TGCGTACTAA CCAGCCCGTT ACTGTTCTGC CCGGTGTGGT CCTGCTGCCT GCGCCCAACC GGGCTAAAAA TTCCCCGGAA GACCCCACAG AAAAGATGGC ACCGGTACCG GGAGGAGTCA TCAACATCGG CGTTGCCCAT GGCTCCTTGC GCATCGAGGG GCGTTACCAG TCCGATGACT TTCCCATTCC GCCGGAGGCG GCGGAACGCC GGGGCCTGGA TTACCTGGCC CTGGGCCACT GGCATTCCTT TTTCCAGTAC GGAGACCGGA CTTTTTATCC CGGCACTCCG GAACCCACCG GCTTTGAGGA ACGAGACAGC GGCACAGCAG CCCTGGTAAC TATTGAAGGG TACGGCGCGC CGCCCCGGGT GGAAAAGATA AAGACCGGCA CCCTGGGCTG GGAAACCTGG CGGCAGGAGG TCCACGGCGA CCCCCGGGAA ACCGTCCGGG CTCTTAAACT CCGGGTGGAA GGCCTGGCCA GCCCCGGCCA GACCCTCCTG CGCCTCGCCC TTTACGGGCG GGATACCAGC GGGGATCAAT CCTGGCTGAA GGAACTCCGG GACTGGCTGG AGGCCCGGTT GCTATACCTC GATCTGGATA CCACCCGCCT GGCTACGCAA CCCCTGACGC CAAAGCTCCA GAACAGGGCT CGCTCCCAGC CCTTCCTGCG GGCTGCCATC GCCGACCTGG CCGCGTTGGC CAGGAGCCTG GGGCAACCCC TGGAAGACGT GGAAAATGTA GCAGACCCGG GAACCAGTCT GGATGCCGAC CTGATCGACA GGATACTCCG GGCCGGCATC AAACCCGGGG ACGTCCGGGA GGCCCTGGGA GTGCTGGCCG AAGTAGTAGA GGAGGTCTAA
|
Protein sequence | MHQVLFSGGD NMIRFLHTAD WQVGMKARHV APVAARVREA RLETARRLME IARERRLDFI IIAGDVFEDN QVDNKLAHQV VQILSLAAPV PVYILPGNHD PLTPDAVYER RVFREGLAPN IHLLRTNQPV TVLPGVVLLP APNRAKNSPE DPTEKMAPVP GGVINIGVAH GSLRIEGRYQ SDDFPIPPEA AERRGLDYLA LGHWHSFFQY GDRTFYPGTP EPTGFEERDS GTAALVTIEG YGAPPRVEKI KTGTLGWETW RQEVHGDPRE TVRALKLRVE GLASPGQTLL RLALYGRDTS GDQSWLKELR DWLEARLLYL DLDTTRLATQ PLTPKLQNRA RSQPFLRAAI ADLAALARSL GQPLEDVENV ADPGTSLDAD LIDRILRAGI KPGDVREALG VLAEVVEEV
|
| |