Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0465 |
Symbol | |
ID | 3832403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 469181 |
End bp | 470224 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828400 |
Product | ABC transporter, substrate-binding protein, aliphatic sulphonates |
Protein accession | YP_429339 |
Protein GI | 83589330 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.176308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAA AACTAAAGCG AATCGCCATC TTGTTCCTGG CGCTGGCCCT GGCCGGCCTG GCCCTGGCCG GTTGCGGGGG CGGTAGCAAA CAGCAGGCTG CAGACCAGCA GACCGGGAGT AGCGGCCAGG GGCAACAACT CACCCCCGTC AAGCTCACCA TGACCACCTG GTCGGGCTAC GGTCCCCTCT TCCTGGCCCG GGATAAAGGC TTCTTCAAGA AACACGGCCT GGATGTACAG TTAATCGTCA TCCAGGGCCT GGGCGAGCGC AAGCAGGCCC TGGCCGGTAA CCAGGTGGAC GGCATCGCTA CCACCCTGGA TATTGAAACC CAGATTGTAG CTGCGGGCAT ACCCCTGAAA CAGATCTGGG CCCTGGACGA TTCCTATGGC GGCGACGGCA TCCTGGCCAA ACCGGAGATC AAGACCATCA AGGACCTCAA AGGTAAAAAC GTAGCCTACG ACTTCGGCAC CGCCAGCCAC ATCCTGCTCC TCTCCATCCT GGCCAAAAAC GGCATGACTG AAAACGACAT CCACCACGTC CAGATGTCAG CCAGCGACGC CGGGTCGACC TTCGTGGCCG GCAAAGTAGA TGCCGCCGTC ACCTGGGAAC CCTGGCTGAG TAAGGCCGTT AAAGAGAACA AGGGTAACCT CCTGGCAACC TCCAAGGAGA CCCCTGGGCT GATTATGGAT ACAGTCGCCC TCCGGAGCGA CTGGGCCGAC AAACACCCCC AGGCTCTCCA GGCCATGGTC GACGCCCTGG CGGAAGCCAT GCAGTACTGG GAAAGCAATA AGGCCGAAGC CAATGCCATT ATGGCCAAGG GACTGGGCAT CAAACAGGAA GAGTTCGAGA GCAACCTGCA GACCCTGCGC CTCTTCAACC TGGCCCAGAA CAAGGAGATG TTCGGCACGG CCGACAAGCC AGGAACCCTC TACACCTCCT TGCAGCAGGC AATCGACTTC GGCTTTAACA ACAAAGTAAT TAAATCCAAA CCCGATGCTA AAGCCATGAT CGACCCGACC TTTGTCAACA GGGCGAAAAT ATAA
|
Protein sequence | MKRKLKRIAI LFLALALAGL ALAGCGGGSK QQAADQQTGS SGQGQQLTPV KLTMTTWSGY GPLFLARDKG FFKKHGLDVQ LIVIQGLGER KQALAGNQVD GIATTLDIET QIVAAGIPLK QIWALDDSYG GDGILAKPEI KTIKDLKGKN VAYDFGTASH ILLLSILAKN GMTENDIHHV QMSASDAGST FVAGKVDAAV TWEPWLSKAV KENKGNLLAT SKETPGLIMD TVALRSDWAD KHPQALQAMV DALAEAMQYW ESNKAEANAI MAKGLGIKQE EFESNLQTLR LFNLAQNKEM FGTADKPGTL YTSLQQAIDF GFNNKVIKSK PDAKAMIDPT FVNRAKI
|
| |