Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2331 |
Symbol | |
ID | 3831083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2450965 |
End bp | 2452164 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637830255 |
Product | hypothetical protein |
Protein accession | YP_431161 |
Protein GI | 83591152 |
COG category | [V] Defense mechanisms |
COG ID | [COG0842] ABC-type multidrug transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCAGC TCCTCAACAT CGCCCACTAC GAAATGCTCC ATATTTTTAA AGAGAAAATC CTTTTTTTAA TGGTTTTCCT GGTCCCCCTC GGCTACGCCG CCCTTTTCGG CGCCGCCTAT GTCACCGCCG TCCTTAACAA CGTGCCTATA GCCATCGTCG ACCTGGATGA CTCAAAACTC AGCCGGGAAA TCGCATCCGC CTTTGCCAAC AGCCCCCACT TCAAGGTGGT GGACGACATA AAGACCTATC CTGAACTTCA GGAGGGGATG AAAAACGGCA GGGTGCGGGC CGGCGTGGTC ATCCCGGAAC ACTTTGAGCA GAAGTTGGCC CGGCACGAAT TGACCCGGGT ACTGACCGTT TACGACGGGT CCAATTTGAT CTGGGGTTAC AATACCCGCA AATACATCCG CGAGGTATTT AACGAGTTCA GCGCCAGCAG CACGGCCTCC TACCTGGCAG GTATGGGCTA TACTAAAAAC GAGATCCGTT CCATTATGGA CACGGTTTCC CTGAACACCG AGGTCTGGTA CAACCCCACT TTCAGCTACA CCAATTTTCT CTTCCTGGGA CTGATCATGA TGATCTTGCA CCAGATCGGC CTCCTCAGCG TCAGCCTGAC CGTAACCCGG GAGAAAGAGC GCAAAACCTG GCTGCAGTTT TTGAGCGCGC CCGTACCGGC ATGGAAAATC TTTACCGGTA AGGCTATCCC GTATTTCACC GCCAACTTCT TTAATTACGC CCTCTTGCTC TGGTTCGCCT CCCGCTTCGT CCACGTGAAG ATCGGCGGCT CCCTGGGTCT AATCCTTGTG CTCGGCCTTC TCTACGATCT AGTCATCACC GGTGCCGGTT TCTTAATTTC CCTCCACGCA TCCAACTCCC TGCAGGTTAC CAGGTACGTG ATGCTTTTGT CCGTACCCTT CTTTATGATT TCCGGTTATA CCTGGCCCGG AACCCATATA CCGGTTTTTA TCAATTACCT GGCGCGGTTG CTGCCCTCCA CCTGGATGGT TCTGGGCTTC CGGCAGGTCG CGCTAAAGGA GCTTGATATG AGCTATATGC TGCCCTACAT CCGGGCCCTG GGCCTGATGG CCGTCCTGGC GCTATTGCCG GCCGTAACCT TTGCCAAGCG GCTCAGGCCG CGCCCGCAAG GCGGCCCGGT GATAAACAAC GGGCCCTCGT ATCCGGCCCG CTGGAAATAA
|
Protein sequence | MRQLLNIAHY EMLHIFKEKI LFLMVFLVPL GYAALFGAAY VTAVLNNVPI AIVDLDDSKL SREIASAFAN SPHFKVVDDI KTYPELQEGM KNGRVRAGVV IPEHFEQKLA RHELTRVLTV YDGSNLIWGY NTRKYIREVF NEFSASSTAS YLAGMGYTKN EIRSIMDTVS LNTEVWYNPT FSYTNFLFLG LIMMILHQIG LLSVSLTVTR EKERKTWLQF LSAPVPAWKI FTGKAIPYFT ANFFNYALLL WFASRFVHVK IGGSLGLILV LGLLYDLVIT GAGFLISLHA SNSLQVTRYV MLLSVPFFMI SGYTWPGTHI PVFINYLARL LPSTWMVLGF RQVALKELDM SYMLPYIRAL GLMAVLALLP AVTFAKRLRP RPQGGPVINN GPSYPARWK
|
| |