Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0665 |
Symbol | |
ID | 3832152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 696946 |
End bp | 698412 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637828604 |
Product | hypothetical protein |
Protein accession | YP_429534 |
Protein GI | 83589525 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000378366 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000415551 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGATTA ACTCTTTTAA TAATTATAAT AAATTAAGCC GATCTTTAAA AAATAGTTTA TCTTTGTGTT TACATTTTTT AACATTATTT TCAATGATCT TGGCAGCTAT ATTAGCAATT ATTCATCTTG CGCAACTTTC AAGATGTTGG TTAATTGTTT CCGTATTTAT TGCTATGATA GTAGCTTTTA TTCTTGGAAA CCCTCATGGG ATTATAACAC GTTCCTCTAC CAAATCATTT CTTTATATTA TGTTAGTAGT AACTCTATTA ATTCGCATGA TATGGATAAT TATGGTACCA TCTTTACCGG TATCAGATTT TGCGGGATAT CATTTCATGG CTCTTGACAT GTCACAGCTC GATTTTAAAC ATGACACAGT ACAGGAAGTA GGATATCCTG TCCTGCTCGG AATACTTTAT GCAGTATTCG GAGGCCATGT GCTTGTAGGA AAAATACTTA ATGTAACTTT AAGTTTATTA ACGGCTTTGG TGTTATTTCT TTTAGCCCGC GAAGCCTTTA GTGAATTCTC GGCGCGAAAC GCTTTGATCT TGTTTTCGCT ATGGCCTGCT CAAATAATGA TGAATAGCGT CTTGGCTTCA GAAGGACCAT TTTTACTTAT GTTTTTAGTG GTCTTGCTAT TGCTTGTTAA GGCAAAATCT GATGACTCCA AACGTCGTCA AGCATTATGT TTTATGGCGG GTCTTATACT CGGGCTATCT TGTACTATTA GAGCGGTAGG GTTTCTTCTA CTTTTTGTTG TTTTAGCCTA CATATGCTTT ATAGATGGTA ACAAGACCAA CAAATGGCAA ATACTAAAAT TATTGCTGGC AGGCTTTTTT TTGGTGGTTA TACCATATTA CCTATTTCGA TGGTTAACCT TCAAGATCCC ACCTACTGTA TCCTCTTTAC CTTTTAATTT ACTTTATGGA ACTAACATTG GATATATAGG TATGTGGAAT CCCGAGGACG CTGCTTTAGC TCATAAACTG ATCGAACAGT ACGGTTCAAG GGCTTCGAGA TATATACTTG GAGTTGCATT TTCTAGGATG ATCTCTAATC CCATGGGCCT TTTGAAGCTG ATGTGTAATA AATTTGAGGT AATGTGGAGT GACGATGCGT ACGGTGCATA TTGGAGTACT ATTAACATTG CTCCTAACTT AATTGAGATC CCTATTATAC GTGTAGACCT ACTTTATATA TTGTCCCAAC TTTATTATGT TTTTATGTTA TGTTTAGCAA TCATCGGCCT GATTAAGATA CAAAAAACAA TAAAGATAAA GATGCAAAGA GATAAGATAT ATAACGTTTC GTTGCTTTTC GTTTTAGTTA TTATTGGATT TGTAATCCTA CATTTATTTA TTGAGGTTCA ATCTAGATAC CACTATCCAG TAGTTGCCAT AATAATTTTA TTAGCCGGTT ACGGTATTAC AGAAAATAGT GTCAACTTTA AGAACGATAG CATCTAG
|
Protein sequence | MLINSFNNYN KLSRSLKNSL SLCLHFLTLF SMILAAILAI IHLAQLSRCW LIVSVFIAMI VAFILGNPHG IITRSSTKSF LYIMLVVTLL IRMIWIIMVP SLPVSDFAGY HFMALDMSQL DFKHDTVQEV GYPVLLGILY AVFGGHVLVG KILNVTLSLL TALVLFLLAR EAFSEFSARN ALILFSLWPA QIMMNSVLAS EGPFLLMFLV VLLLLVKAKS DDSKRRQALC FMAGLILGLS CTIRAVGFLL LFVVLAYICF IDGNKTNKWQ ILKLLLAGFF LVVIPYYLFR WLTFKIPPTV SSLPFNLLYG TNIGYIGMWN PEDAALAHKL IEQYGSRASR YILGVAFSRM ISNPMGLLKL MCNKFEVMWS DDAYGAYWST INIAPNLIEI PIIRVDLLYI LSQLYYVFML CLAIIGLIKI QKTIKIKMQR DKIYNVSLLF VLVIIGFVIL HLFIEVQSRY HYPVVAIIIL LAGYGITENS VNFKNDSI
|
| |