Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1952 |
Symbol | |
ID | 3832302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2028770 |
End bp | 2030287 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829883 |
Product | ABC transporter related |
Protein accession | YP_430793 |
Protein GI | 83590784 |
COG category | [R] General function prediction only |
COG ID | [COG3845] ABC-type uncharacterized transport systems, ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00677494 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGTAC CGCTGGTTGA AATGCGGGGG ATAACCAAGG TTTTTCCGGG CGTGGTGGCC AACGAGGGGG TCAACCTGGC CGTCCACGCT GGGGAAATTC ACGCCCTGCT GGGGGAGAAT GGGGCGGGAA AAAGTACCCT GATGAGTGTT CTTACGGGCC TGTACCGGCC CGATGGCGGG GAGATCTACC TGGACGGCCG GAGGGTCAAT TTCCGCTCGC CCCGGGACGC CATTGAGGCG GGTATCGGCA TGGTCCACCA GCACTTTCGG CTGGTGGCGC CCTTCACGGT AACGGAGAAT GTGGCCCTGG GGCTTAAAGG CGGCCTTAAA CTCAACGTAA ACCGGCTGGC CGGGGAAATA GCCGCGCTTT CTAAAGAATA CGGCCTCCAG GTGGACCCCC AGGCGCGGAT CTGGCAGCTT TCAGTGGGGG AACAACAGCG GGTCGAGATC ATCAAGCTCC TGTACCGGAA GACCCGGGTC CTGATTCTCG ATGAACCGAC GGCCGTCCTG ACCCCCCAGG AAGCCCGGGA TCTGTATCGG ACCTTGAAAA GAATGGCTGC CCAGGGCTGC GCCGTGATCT TCATTACCCA TAAACTCCAG GAAGTTATGG ACGCCGCCGA TACCATTACT ATTCTCCGGG GTGGCAAGAC GGTGGCCACA GTGAAAAAGA GTGAAACCAG CGAAAAGGAA CTGGCCCGGA TGATGGTCGG CCGGGAGATT GTCTGGCAGG GGGATAAGCC TGTAGCTAAA AAGGGCGAAA AGATCCTGGA GATCAGAGAT TTAAGGGCTC TAAACGATAA AGGCCTGCCA GCTCTACGGG GTATCAACCT GGAGGTCTTT GCCGGTGAGA TCCTGGGTAT TGCCGGGGTG GCCGGCAACG GCCAGCGGGA GTTGGCCGAG GTCATCGCCG GCTTGCGGCC CTGCCAGGGC GGCAGCATTA CCGTTTCCGG GCAGGAACTA GGCCAGTGCG ACCCCTGCCG GGTAATCCAG GCCGGAGTGG GCTATATACC CGAAGATCGC CTGGGTACGG GCCTTATACC CAACCTGGGA GCTGTCGACA ACCTGCTCCT GAAGGAATAC CGCCATCCCC GCTGGGGTAG GACCATCATG AACCGGAGGG CAGCGCGCCA GTGGGCCGGC GAACTGGTGC AACACTTCAA AGTAAAGATG GCCGGTCTGG ACGCCCCGGT GAAAATGATG TCCGGCGGCA ACCTGCAGCG GCTGCTCCTT GCCCGCGAGA TTTCCTCCCG GCCGCGCCTG TTGGTGGCCG TCTACCCGGC CCGGGGCCTG GACGTTGGCG CTACGGAAAC GGTCCACCGC CTGCTCCTGG AGCAGCGGGC CGCCGGTACG GCCATTCTTC TGATTTCCGA GGACCTGGAG GAGCTCTTCC GCCTGGCCGA CCGCATAGCC GTAATGTATG AAGGCCAGAT TATGGGCTTG ATGCCCACTG AAAAGGCCAG CGTGGAGGAG CTGGGCCTGA TGATGGCCGG GGCGAAAAGA ATGGAGGTCG GAGCCTGA
|
Protein sequence | MAVPLVEMRG ITKVFPGVVA NEGVNLAVHA GEIHALLGEN GAGKSTLMSV LTGLYRPDGG EIYLDGRRVN FRSPRDAIEA GIGMVHQHFR LVAPFTVTEN VALGLKGGLK LNVNRLAGEI AALSKEYGLQ VDPQARIWQL SVGEQQRVEI IKLLYRKTRV LILDEPTAVL TPQEARDLYR TLKRMAAQGC AVIFITHKLQ EVMDAADTIT ILRGGKTVAT VKKSETSEKE LARMMVGREI VWQGDKPVAK KGEKILEIRD LRALNDKGLP ALRGINLEVF AGEILGIAGV AGNGQRELAE VIAGLRPCQG GSITVSGQEL GQCDPCRVIQ AGVGYIPEDR LGTGLIPNLG AVDNLLLKEY RHPRWGRTIM NRRAARQWAG ELVQHFKVKM AGLDAPVKMM SGGNLQRLLL AREISSRPRL LVAVYPARGL DVGATETVHR LLLEQRAAGT AILLISEDLE ELFRLADRIA VMYEGQIMGL MPTEKASVEE LGLMMAGAKR MEVGA
|
| |