Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1484 |
Symbol | |
ID | 3832365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1531169 |
End bp | 1532503 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637829417 |
Product | extracellular solute-binding protein |
Protein accession | YP_430337 |
Protein GI | 83590328 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000980441 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAACT GTAAGTGGCG TAAACCCCTG GGAATTGCCC TGCTAGTGGC AACCATGGCG AGTAGCCTGG TGACAGGTTG CGGTCCGGGA AACCCGGCGA AAGGTACCGG CCAGGAGAAG ACAGGCGGTA AGGTAACAAC AATTGAATAC TGGCATGTTA ACTCGGAAAA CTTTGGCGGG CAGACCGTCC GCGATCTGGT GCAGAAGTTT AACGAACAGC ACCCGGACAT TAAAGTAGTT GAAAAGTTTC AACCCAACAT GTACCTGGGG TTAATGCAAA ATTTACAGGC CGCCCTGGCG GGGGGCCATC CTCCGGCAGT AGCCCAGATT GGCTATAATT ACCTTGACTA CGCCACCGCC AACTTGCCCC ACCTGCCGGT AGAAGACGCC GCTAAAAAGG ATCCCGAGGG GCAGGCTTTT CTAAATAACT ATCTGCCCAA TATCTTAAAC CTGGGCCGGG TTAACGGCAA ACTGGAAGGT ATGCCCTATT CCATCAGCAA CCCTGTCCTA TATTACAACG CCGACATGTT TAAAGCAGCC GGCCTGGATC CTCAAAATCC ACCTAAGACC TGGGCCGAGG TACGGGATAT GGCCAGGATA ATCAAAGAGA AAACCGGCAA CTACGGCCTT TATGTGCAGG AGCCTTCCGA CAACTGGGCC CAGCAGGCCA TGATGGAGTC CAATGGCGCT CAGGTACTGA CCAGGACGGG AGGTAAGGCC AGCGCTACCT TTGATAGCCC CGAAGCCATA GAAGCTTACC AGTTGATGGC CGACATGGTT TTAAAGGATA AGACGGCCTT GCACGCCACC TGGGAGGAAG GTACCCAGGC CTTTATTACG GGAAAAGTGG GAATGTATGT TACAACTATT GCCAGGAGAA ATTATATTGA AACTTCTTCT AAGTTTAAGG TTTTAGCGGC TCCTTTTCCA ACCTTCGGTA ACAAACCGCG GCGGGTCCCG GCCGGAGGTA ATGCCCTGTT TATCTTCGCT AAAGATCCTG ACCAGCAAAA GGCTGCCTGG GAGTTCATCA AGTACCTGGA ATCCCCCGAG GCTTTAACAA CCTGGACCAA AGGTACTGGC TATCTGCCTC CCCGGAAAGA TGTGACCGAA GATCCCAACT ACCTGAAACC TTTCATGGAC CAGAATCCGT TAATGAAGCC GGCGGCAGCT CAGCTTCCCG ACGCTGTTCC CTGGGTAAGT TTTCCCGGCA ATAACGGTCT TCAGGCCGAA CAGATCCTCC TGGACGCCAG GGATGCCATC CTCGGCGGTC GCCAGTCGGC AGCAGAAGCC CTGAAGGAAG CCGTGGCTAA GGTAAATAAA TTAATCGGCA ATTAA
|
Protein sequence | MINCKWRKPL GIALLVATMA SSLVTGCGPG NPAKGTGQEK TGGKVTTIEY WHVNSENFGG QTVRDLVQKF NEQHPDIKVV EKFQPNMYLG LMQNLQAALA GGHPPAVAQI GYNYLDYATA NLPHLPVEDA AKKDPEGQAF LNNYLPNILN LGRVNGKLEG MPYSISNPVL YYNADMFKAA GLDPQNPPKT WAEVRDMARI IKEKTGNYGL YVQEPSDNWA QQAMMESNGA QVLTRTGGKA SATFDSPEAI EAYQLMADMV LKDKTALHAT WEEGTQAFIT GKVGMYVTTI ARRNYIETSS KFKVLAAPFP TFGNKPRRVP AGGNALFIFA KDPDQQKAAW EFIKYLESPE ALTTWTKGTG YLPPRKDVTE DPNYLKPFMD QNPLMKPAAA QLPDAVPWVS FPGNNGLQAE QILLDARDAI LGGRQSAAEA LKEAVAKVNK LIGN
|
| |