Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1849 |
Symbol | |
ID | 3831710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1907172 |
End bp | 1908131 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829781 |
Product | hypothetical protein |
Protein accession | YP_430692 |
Protein GI | 83590683 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATCA AAGAAGCAGC AATTCGGCCA ATTACCTCCG GATGTGCCGC CGGAGCTGAC CGCAGGTTGG GTTATATTGC CATTATCCTG GCCAGCCTCC TTTACGGGGG TAACGTCATC GCCGGCCGGG TGATCGCTCC CCTGGTACCG CCTCTGGCCC TGGCGGCGGC CAGGGGGTTG CTGGGCCTGC CGGTGCTGCT GCTATTTGCT TTGAAGGCCG GGGGTAAGCC GCGCCTGGCA GACCTCCCTT ACATGGCCCT GATGGGTTTC CTGGGCATTA GCATCGCTTA CGGTACCTTC TCCTGGTCCA TGCAGAACAG CCCGGCCGTC AATGCCGCCA TTATCTTCGC TACCTTTCCC GCTGTTACCC TGGTCCTACT GGCCATCGGC TGGCACGTTA AACCCTCCCG TTACCAGGTA GCGGGCATCA TTATGGCCTT CATCGGCCTG GCTCTGGTCT CAACCCGGGG CTCCCTGGCC CAGCTCCTTG CCATGCGCTT CCAGCCGGTG GACCTGGTCC TCCTGGCGAA TGTCACAGCA GCTTCCCTCT ACAACATCCT GGGGCAACGC ATGGTGGAAC GTTATTCGCC TATTGTTACC AGCACCTATT CCTTGTTCTT CGGTACCCTC TTCCTGCTGC CCGCCGGTTT CTGGGAGGTT AGCCGCCAGG GCTGGTACCT GCCCCCCTCC GGATGGCTGC TCCTCATCTA CATGGGTTGT ATCATCGCCG GGCTGGCGGT TTTACTTACC TTCGAGGCCG TCGAACGTAT AGGGTGTGGG CCGGTAGCCA TGTTTAATAA CTTGAACCCC CTTTTTGCCA TTGCCCTGGC AGCCTTGTTC CTGGGAGAAA AACTGAGCTG GTACCACTGG GCCGGCATTA TCCTGGTCCT GGGCGGGGTA TGCATTTCCC TGCGGCAGCA ACCGGGGCGA CAGCAGGGGC AAAAACAAGG GCAGCAGTAA
|
Protein sequence | MAIKEAAIRP ITSGCAAGAD RRLGYIAIIL ASLLYGGNVI AGRVIAPLVP PLALAAARGL LGLPVLLLFA LKAGGKPRLA DLPYMALMGF LGISIAYGTF SWSMQNSPAV NAAIIFATFP AVTLVLLAIG WHVKPSRYQV AGIIMAFIGL ALVSTRGSLA QLLAMRFQPV DLVLLANVTA ASLYNILGQR MVERYSPIVT STYSLFFGTL FLLPAGFWEV SRQGWYLPPS GWLLLIYMGC IIAGLAVLLT FEAVERIGCG PVAMFNNLNP LFAIALAALF LGEKLSWYHW AGIILVLGGV CISLRQQPGR QQGQKQGQQ
|
| |