Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1688 |
Symbol | |
ID | 3833288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1726028 |
End bp | 1727158 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829613 |
Product | glycine betaine/L-proline transport ATP binding subunit |
Protein accession | YP_430533 |
Protein GI | 83590524 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1125] ABC-type proline/glycine betaine transport systems, ATPase components |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.315313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.421544 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTACCT TCGAACACGT CTCGAAAGTA TATGACGGCA ATCGAATAGC CGTAGCCGAT TTCAACCTGG AAGTCGAGGC CGGAGAATTT ATTGTGTTAA TTGGCCCCAG CGGTTGTGGC AAGACCACCA CTTTAAAAAT GGTTAACCGT CTTATCGAAC CCACCTCCGG GGCCATCTAC CTTAACGGCA AGGATATCCG GGAACAAAAT CCCGTGGCGT TACGGCGACA CATTGGCTAC GTTATCCAGC AGATAGCTCT TTTTCCCAAC ATGACTATTG CTCAAAACGT GGATGTGGTA CCCCGCCTGC TGGGATGGCC GGCAGAACGC CGCCGCCAGC GCGTTTGCGA ATTATTGGAA CTGGTGGGTA TGGACCCTGA TGACTACGCT GACCGTTACC CTTCAGAGCT AAGCGGGGGG CAGCAACAGC GTATCGGGGT GTTGCGTGCC CTGGCGGCAG AACCACCGCT TATCCTTATG GATGAGCCTT TTGGTGCCCT TGACCCAATT ACGCGGGAAA ACCTGCAGGA AGAATTGAAG GCCTTGCAGG CCAAGCTGCA TAAGACCATT CTCTTTGTTA CCCACGATAT GGACGAGGCA CTGAAAATTG CTGATCGGAT TGTGGTAATG AAAGACGGCT ACATCGTCCA AGTCGCTGCG CCTGAAGAAC TGTTGCGGCA CCCCGCCAAC GAGTTCGTGG CCTCGTTCAT CGGCAAAGAA CGGTTGGCTC CTGGACTGGA ATTGCGCACC GTAGAACAGG TTATGATTGG TGAACCGGTG ACGGTACGGC CCCATACGGG TGTTGCCGAA GGAGTTGCCA CCATGCGTCG TAAAAAGGTG GATACGCTGC TGGTTACCGA TGAATCTGGC CGGCTGTTAG GCGCCGTTTC TATCGAGGAA TTGAATCGCA ACTACCAGCG GGCTCACCAG GTGCAAGATT TGATGGCTCG TGACGTTCCT GTAGTGTTCG AGGGAACCCC GGCCCGGGAG GCCTTTGACC TGATCACCCG GGAGCGGCTG GAGTACCTGC CGGTAATCGA TAAGGAGGGC CGCTTGAAGG GACTGGTCAC CAGGACCAGC ATGGTCAATG CCCTGGCATC CGTGGTGTGG GGAGATGAGG CTAGTGCTTA G
|
Protein sequence | MLTFEHVSKV YDGNRIAVAD FNLEVEAGEF IVLIGPSGCG KTTTLKMVNR LIEPTSGAIY LNGKDIREQN PVALRRHIGY VIQQIALFPN MTIAQNVDVV PRLLGWPAER RRQRVCELLE LVGMDPDDYA DRYPSELSGG QQQRIGVLRA LAAEPPLILM DEPFGALDPI TRENLQEELK ALQAKLHKTI LFVTHDMDEA LKIADRIVVM KDGYIVQVAA PEELLRHPAN EFVASFIGKE RLAPGLELRT VEQVMIGEPV TVRPHTGVAE GVATMRRKKV DTLLVTDESG RLLGAVSIEE LNRNYQRAHQ VQDLMARDVP VVFEGTPARE AFDLITRERL EYLPVIDKEG RLKGLVTRTS MVNALASVVW GDEASA
|
| |