Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0569 |
Symbol | |
ID | 3832482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 591931 |
End bp | 593439 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637828510 |
Product | hypothetical protein |
Protein accession | YP_429442 |
Protein GI | 83589433 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0080395 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTTTT ATTATGGAAT TGGCCCGAAA ATTGTATTCT CAGAGAAGGG ACAGGTGGAG ATGAGCATTA AAAGAGTTGC GGTATTAACC TTGCTTCTTT GCGGCTCTTT TTTGCCCCTC GGGGGGGTGG CATGGGCGGC GACAGGAAAC GAGCTGGGTA AAATGTTACC TTTATGGAGC GTATTACCCT TTGTAGGGAT GCTCCTGTCC ATTGCCATCT GGCCGCTGGT CAACGCCTCC TGGTGGGAGC ACAACATGGG GAAGGTAAGC TTGTTCTGGT CCCTGGTTTT CTTTGTACCG TTCTTAATTG CCTTCGGGAG CGGTACGGCC TTTACCCAGG CAGTGGAAGT GTATCTCCTG GATTACCTGC CCTTTATCAT CCTTCTCTTC GGTCTCTTCG TAGTGGCAGG GGGAATCATT CTTCGTGGTA CCCTGCGGGG TAATCCGGGT GTGAACGTCC TGCTCCTGCT GGTGGGCACG ATCCTCTCAA GCTGGATTGG GACCACCGGA GCCAGCATGC TCATGATCCG GCCGGTTATC CGGGCCAACG AGTGGCGGCG TTACAAGGCC CACATAATCA TCTTTTTTAT CTTCCTCATT TCCAATATCG GCGGGGCCCT CACCCCCGTA GGCGACCCGC CCCTGTTCCT GGGTTATTTA CGGGGGGTGC CCTTCTTCTG GACCATGAGG TTAATCCTGC CCATGGGATT TAATGTACTA ATCCTCCTCA CCCTTTACTA CTTCCTGGAT AGTTACTTTT ACCGTAAAGA AAAGGTGCCC CGGGCAAAGG GCGGGCAGGA CCCCTTGCGG GTAGAGGGTT TACAGAATCT TATTTACCTG GGTATCATTG TCGGCGCTGT TATTTTAAGT GGTATCCTGG CCAAGAACCC GGCCTTTGCC GACCAGCAAA CCGGGAACCT TTACGGCATT ACTATCTTCC GCCACGGCGA AGAAGCAGTG GTGCTGCCCT ACACCAACAT AATCCGCGAC CTGGCCATTC TCCTGGCGGC TTTCCTGTCA TGGAAGACTA CTTCTATGGA TATTCGTAAA GATAATCGCT TTACCTGGGG TCCCATCAAG GAAGTAGCCA TCCTTTTTGC CGGTATCTTT ATGACCATGA TTCCGGCCCT GGCCATCCTC CACGCCCGCG GCGCGGAACT GGGCTTAACT CACCCGGCCC AATTTTTCTG GGCCACGGGA GCCCTTTCCA GCTTCCTGGA TAACGCCCCG ACGTACCTGG TATTCCTGAC CACCGCTACC AGCCTGGGGG CAACCACCGG GGTGCCTACC ACCCTGGGTG TCGTGGCTCC CAAGATGCTC CTGGCCATAT CCTGCGGCGC CGTGTTTATG GGTGCCAATA CTTATATCGG TAATGCCCCC AACTTTATGG TGCGGTCAAT AGCGGAAGAG AATAATATTC GCATGCCCAG CTTTTTCGGC TACATGGGCT GGTCGATAGG GATTTTAATT CCTTTATTTA TCCTGGACAC CTTGATTTTC TTCCGTTAA
|
Protein sequence | MFFYYGIGPK IVFSEKGQVE MSIKRVAVLT LLLCGSFLPL GGVAWAATGN ELGKMLPLWS VLPFVGMLLS IAIWPLVNAS WWEHNMGKVS LFWSLVFFVP FLIAFGSGTA FTQAVEVYLL DYLPFIILLF GLFVVAGGII LRGTLRGNPG VNVLLLLVGT ILSSWIGTTG ASMLMIRPVI RANEWRRYKA HIIIFFIFLI SNIGGALTPV GDPPLFLGYL RGVPFFWTMR LILPMGFNVL ILLTLYYFLD SYFYRKEKVP RAKGGQDPLR VEGLQNLIYL GIIVGAVILS GILAKNPAFA DQQTGNLYGI TIFRHGEEAV VLPYTNIIRD LAILLAAFLS WKTTSMDIRK DNRFTWGPIK EVAILFAGIF MTMIPALAIL HARGAELGLT HPAQFFWATG ALSSFLDNAP TYLVFLTTAT SLGATTGVPT TLGVVAPKML LAISCGAVFM GANTYIGNAP NFMVRSIAEE NNIRMPSFFG YMGWSIGILI PLFILDTLIF FR
|
| |