Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1901 |
Symbol | |
ID | 3831174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1965900 |
End bp | 1966871 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829834 |
Product | ABC transporter related |
Protein accession | YP_430744 |
Protein GI | 83590735 |
COG category | [V] Defense mechanisms |
COG ID | [COG1131] ABC-type multidrug transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.298526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAATG ACAATCCTGC CATGATCATT GCAGCGCGCG GTCTGGTGAA AAGCTTCGGA CCGATCCGGG CCGTGGATCA CATTGATTTA CAGGTGGAGA AGGGGGAAAT TTTCGGCCTG GTAGGACCGG ACGGGGCCGG GAAGACAACG ACCATGCGCA TGCTGGCAAC TATCCTCCCG GCTGATGCCG GAGCAATATC CGTTCTGGGT TATGACGGCC GGACGGAAGC TGAGCGCATT AAGGAGCACA TTGGTTATAT GCCCCAGCGG TTCAGCCTGT ACGGGGATCT AACAGTGGCG GAAAACCTGG AATTCTACGC TGAAATCTAT GAAGTTCCCC GAAAGGTGCG GGAGCAGCGC AAAAAAGATC TCCTGGCGTG GGCCAACCTT ACCCGGCATA GCTATAAACA GGCCGATCAG TTGTCGGGGG GAATGAAACA AAAACTGGCC CTGGCGTGTA ACCTGATCCA CGAACCCGCC GTTCTTTTCC TGGACGAACC CAGCACGGGA GTAGACCCGG TGGCGCGGCG CGATTTCTGG CGCATCCTCT TCAGGTTGCG CGAGGAGGGG GCGACGATCA TGGTCAGCAC GCCTTATATG GATGAGGCCG AGCGTTGCGA CCGCATCGCC TTTACTTATA ACGGCCGCAT CCTGACTTGC GGTACCCCGG CGGCAGTTAA GAATTTATTC CGGGGCCAGC TCCTGCTCTT GCGTGCGGAG ACTATTGCCA TGCTCCACGC CGCCAGGGAC TACCTCCGCC GGGAACAATT GCTGGCTGAT GTCTTGATTT ATGGCGACGC TTTGCACCTG GTGACCGACG ATGCCCTGGA AACGGCAAGG CTTCTACCGG GGCTCCTGGA ACGCCAGGGT ATCCGGGTTA CCCATCTCCA GCCTATTCCT CCTTCTCTGG AAGATACCTT TGCTTACCTG GTCAGACAGG CAGGAGGATT CGCGGGGAGG GAGTCCGCTT GA
|
Protein sequence | MVNDNPAMII AARGLVKSFG PIRAVDHIDL QVEKGEIFGL VGPDGAGKTT TMRMLATILP ADAGAISVLG YDGRTEAERI KEHIGYMPQR FSLYGDLTVA ENLEFYAEIY EVPRKVREQR KKDLLAWANL TRHSYKQADQ LSGGMKQKLA LACNLIHEPA VLFLDEPSTG VDPVARRDFW RILFRLREEG ATIMVSTPYM DEAERCDRIA FTYNGRILTC GTPAAVKNLF RGQLLLLRAE TIAMLHAARD YLRREQLLAD VLIYGDALHL VTDDALETAR LLPGLLERQG IRVTHLQPIP PSLEDTFAYL VRQAGGFAGR ESA
|
| |