Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0422 |
Symbol | |
ID | 3832105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 425616 |
End bp | 426623 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637828357 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_429296 |
Protein GI | 83589287 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0386789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAGT GGAAAAAATT AATTCTACCA TTATTATTGT TAATAGCAAT GCTAATTATG AGCGCATGTG GTAAATCTCC GGCAGATAAA CAACAGGATA GCAACACATA TACCCTTAAA ATTCATATGG TAATAAATGA GCAGGATCCT ATATATCAAG GCTATGCTGA GTTCAAAAAG GGGGTAGAAG CACGGACAAA TGGCAAAGTA AAGGTTGAGC TTTATCCCAA CGGTGTGTTG GGTAACGATG AAGATTTACT TCAACAGGCA ATGCTAGGCG GGAACGTAGC GGTTAATTCG GACGCTGGAC GGCTAGGAGT ATGGGTACCG GAGATGGGTA TTTTGTTAGC GCCGTATCTT ACCGACAATG CAAACGAGAT GCAAAAATTG CTAAAATCTG ATCTTGTAAA GAAATGGACG GATGAACTTG CCCAGAAAAA AGGGTTAACT GTACTTTCGT TCAATTATTA CGCTGGTGCT CGCCATTTTA TAACTAAAAA ACCAATTTCT ACACCTGAAG ATCTAAATGG ATTAAAAATA AGAACTCCTG GCTCACCTGT TTGGCAGGAA ACTATCAAAG CTATGGGGGC CACTCCTGTT GCTTTACCAT GGACTGAAAC TTATCCTGCT ATAGAGCAGG GAGTCATTGA TGGCGCCGAA GCCCAAGATT CTGCGACTTA TGGTGCTAAA ATTTATGAGG TTGCCAAGTA TATCACGAAA ACCGGCCATA TTCAGCTCTG GAATTGCTTA GTGGTAGGAA CAAAATGGTT CGAACAACTT CCCAAAGAAT ACCAGCAAAT CCTGATAGAA GAATCAATTA AGGCCGGAGA CTTTACTACT AACAAGGTCT TGGCGAGTGA AAAAGAATTA GAAGATAAAA TGGCTGCAAC TGGCGCTATA ATTAAAGAAG TAGATATTAA ACCCTTTAAA ACAAAGTCAG AAATAGTATA TAGCAAATTA AACTACCAGG ATTTGCGCCA GGAAGTCAAT AAAGTGTTAG GGAAATAA
|
Protein sequence | MFKWKKLILP LLLLIAMLIM SACGKSPADK QQDSNTYTLK IHMVINEQDP IYQGYAEFKK GVEARTNGKV KVELYPNGVL GNDEDLLQQA MLGGNVAVNS DAGRLGVWVP EMGILLAPYL TDNANEMQKL LKSDLVKKWT DELAQKKGLT VLSFNYYAGA RHFITKKPIS TPEDLNGLKI RTPGSPVWQE TIKAMGATPV ALPWTETYPA IEQGVIDGAE AQDSATYGAK IYEVAKYITK TGHIQLWNCL VVGTKWFEQL PKEYQQILIE ESIKAGDFTT NKVLASEKEL EDKMAATGAI IKEVDIKPFK TKSEIVYSKL NYQDLRQEVN KVLGK
|
| |