Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1902 |
Symbol | |
ID | 3831175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1966875 |
End bp | 1968062 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637829835 |
Product | secretion protein HlyD |
Protein accession | YP_430745 |
Protein GI | 83590736 |
COG category | [V] Defense mechanisms |
COG ID | [COG1566] Multidrug resistance efflux pump |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0432629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAGTCA AAAAGTGGCC CCCCTTATTG AAAAGATTGT TAATAGCCGG CGCCATCATT TTTGCCGGCG TCGCCTTTGC CCTTATTAGC CGTAGTCCTA AGCCTGCCAC CGTCGGCATA TACGAAGGGC ATCCGGTTCT CTACGGTTCC GGGGTGATCG AGACAACCGA AATTGGTGTT GGCGCTGAAA TTCCCGGCAG GATCCGTCAG GTGCTGGTGC AGGAGGGCCA GTTTGTAACG GCAGGGAGCA TTATTGCTGT TTTAGATGAC GCCGAGCTGC AAGGACAGGT GGCCCAGGCC AAAGCCGCGG TTGCCGGGGC CGAGGCCCAG CTGGCCCAGG TCAGAGCTGT GTATACAGCC GAACAGGCCG GGGTGGAAGG GAACATTCAA ATGGCCCTGG CCGCCTTGCA GAAGCTGACG GCAGGTGCCC GCCAGCAAGA GATTGATGCC GCCCAAAAAA AAGTCGACCA GGCCAGGGCT AAATTACAAA GCGCCCAGGA ACAGCTCCGC CGCATGGAAA CGCTGCACCA GCAGGGGGTT ATCTCCGACG AGCAGTACCA GCAGGCCAAA ACCAATTATG AAGTTGCCCG GGCCGATTTG GGAGCGGCCG AAGACAATTT GTCTTTGCTT GTCTCCGGTT CCCGGCCGGA AGATATAGCC GCCGCCAGGG CCAATTATGA AGTCGCTCTT GCCGGTCGCT CCCAGGTAGA GGCCCGCCGG AAAGATGTAG ATGCCGCAGC CGCAGCCCTT GATAAAGCAA AGGCGGCGTT AAAGACTGCC GAGGAGCAAC TGGCCAAGGC AACTATCCGG GCTAAAACCA GCGGTGTTGT TCTAAGGTGT AATTTTAGTG CCGGCGAGGT TGTGAATCCC GGTATTCCCA TCGTTACCCT AAGCGATCCC GCGGACCTCT GGCTGGCAAT CTACGTTCCC GAGACGGAAA TCGGTAAGGT AAAGGTGGGC CAGCAGGCAG TCGTAACGGT GGATTCCTTC CCGGGTAAAC GCTTTAATGG CAGGGTGAAG GAGATCGCCG GCCAGGCCGA ATTTACGCCC AAAAACATCC AGACCAAGGA AGAGCGGGTA GACCTGGTGT TTAAGGTGAA AATTTCCCTG GCTAACGAAG AACAACTGCT AAAACCGGGT ATGCCGGCGG ATGCCATGGT TTACCTGGAC AGCCAGGAGG CAAATTAA
|
Protein sequence | MLVKKWPPLL KRLLIAGAII FAGVAFALIS RSPKPATVGI YEGHPVLYGS GVIETTEIGV GAEIPGRIRQ VLVQEGQFVT AGSIIAVLDD AELQGQVAQA KAAVAGAEAQ LAQVRAVYTA EQAGVEGNIQ MALAALQKLT AGARQQEIDA AQKKVDQARA KLQSAQEQLR RMETLHQQGV ISDEQYQQAK TNYEVARADL GAAEDNLSLL VSGSRPEDIA AARANYEVAL AGRSQVEARR KDVDAAAAAL DKAKAALKTA EEQLAKATIR AKTSGVVLRC NFSAGEVVNP GIPIVTLSDP ADLWLAIYVP ETEIGKVKVG QQAVVTVDSF PGKRFNGRVK EIAGQAEFTP KNIQTKEERV DLVFKVKISL ANEEQLLKPG MPADAMVYLD SQEAN
|
| |