Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1081 |
Symbol | |
ID | 3833194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1111782 |
End bp | 1113296 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829009 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_429938 |
Protein GI | 83589929 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.075195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAATAG CTTTCCTGGT CGGCGGGGCG GGTGGTTATG CCATCCGCAA ATACCTGGCT GAGGCCAAGA TAGCTTCAGC CGAAAAGGCT GCCGCCACCA TTATCGAAGA GGCCAAAAAA GAAGCTGAAG CCAGGAAAAG GGAAGCGGTT CTGGAGGCCA AGGATGAAGT CCACCGCATG CGTAATGAGG TGGAACGGGA GAGCAGGGAA CGACGCAATG AACTCCAGCG TTTAGAGCGG CGCTTGCTGC AAAAAGAGGA AACCCTGGAA CGCAAATCTG AAACCCTGGA ACGCAAAGAG GCCAGCCTGC ACCGCCAGGA AGAAGCGATT CAGCGTACCA GGGAAGAGGT AGAGAAAATT CGCCAGCAGC AAGTGAGCGA ACTGGAGCGG ATTTCCGGCT TAACTACCGA GGCTGCCAGG AATATCCTTT TGAAAAACGT GGAGGAAGAA ATCAGGCATG AAACAGCAAT GCTCATCAAG CAGGTAGAGG CTGAAGCGAA GGAAGAGGCC GAGAAAAGGG CCCGGGAAAT TATCACCTAC GCTATCCAGC ACTGTGCTGC CGACTATGTA GCTGAAGCTA CAGTATCGGT AGTTAACCTG CCCAATGACG AGATGAAGGG GCGGATAATC GGCCGTGAGG GCCGGAATAT CCGGGCCCTG GAAACCCTGA CTGGCGTAGA CCTCATTATA GATGATACGC CGGAAGCGGT TATCCTGTCC TGCTTTGATC CTATTCGCCG GGAGATTGCC CGGATAGCCT TGGAAAAACT TATAGCCGAT GGGCGCATCC ATCCGGCGCG GATTGAAGAA ATGGTGGAAA AGGCCCGGCG GGAACTGGAT ACGAAGATCC GGGAGGAAGG CGAGCAGGCC ACCTTCGAGG TGGGTATCCA CGGCCTGCAC CCGGAATTGG TGCGCCTGCT GGGTAAATTG AAATATCGTA CCAGCTACGG CCAGAATGTC CTGAAACACT CCCTGGAAGT CGCCTTCCTG GCGGGGGCTA TGGCCGCCGA ACTGGGTGTA GATGTACTGG TGGCTAAACG GGCTGGCCTG CTTCATGACA TAGGCAAGGC GGTGGACTTT GAAGTTGAAG GCCCCCACGT CAACCTGGGG GTTGAACTGG CCAAAAAGTA CCGGGAGTCA CCGGAGGTCA TTCATGCCAT TGAAGCCCAT CACGGCGATG TAGAGCCTAA AAGTATTGAA GCTGGACTGG TCCAGGCTGC TGATGCCATT TCCGCCGCCC GTCCCGGAGC CAGGCGTGAG ACCCTGGAAG CCTATATTAA GCGCTTAGAA AAACTGGAAG AGATTGCCAA TTCCTTTAGC GGCGTAGAAA AATCCTATGC CATCCAGGCC GGACGTGAAG TTCGTATCCT GGTTAAACCG GATAAGATTG ACGATGCCAT GGCCGTTCGC TTGGCCCGGG ATATCGTCAA AACCATCGAG CAGACAATGG AGTATCCAGG CCAGATCAAG GTAGTGGTCA TCCGGGAAAC CCGGGCTGTA GATTACGCCA AATAG
|
Protein sequence | MLIAFLVGGA GGYAIRKYLA EAKIASAEKA AATIIEEAKK EAEARKREAV LEAKDEVHRM RNEVERESRE RRNELQRLER RLLQKEETLE RKSETLERKE ASLHRQEEAI QRTREEVEKI RQQQVSELER ISGLTTEAAR NILLKNVEEE IRHETAMLIK QVEAEAKEEA EKRAREIITY AIQHCAADYV AEATVSVVNL PNDEMKGRII GREGRNIRAL ETLTGVDLII DDTPEAVILS CFDPIRREIA RIALEKLIAD GRIHPARIEE MVEKARRELD TKIREEGEQA TFEVGIHGLH PELVRLLGKL KYRTSYGQNV LKHSLEVAFL AGAMAAELGV DVLVAKRAGL LHDIGKAVDF EVEGPHVNLG VELAKKYRES PEVIHAIEAH HGDVEPKSIE AGLVQAADAI SAARPGARRE TLEAYIKRLE KLEEIANSFS GVEKSYAIQA GREVRILVKP DKIDDAMAVR LARDIVKTIE QTMEYPGQIK VVVIRETRAV DYAK
|
| |