Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0921 |
Symbol | |
ID | 3831310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 955161 |
End bp | 956591 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637828852 |
Product | hypothetical protein |
Protein accession | YP_429781 |
Protein GI | 83589772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000000581224 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAACG GTAGTACCAG TACCGCTACC ATCATTAGTT TATTAGATGG TATCGAGAAT TTCAACACTC TTGAAGAAGT TATCCTGCAA ATAGCCAGGA GGTTATTGGT AGCCGTACTG GAAGCCCTGG ATGATACCCT TATGCCGGCA AAACCTAAGG GATATAAGAT AGCTGGGTTC CGCTACCGCA CAATCACCTG CCTGTACGGG GATATAACCT TTAAGCGCCG GCTATATGTT AAAGCAACGC GCAAAAAGAA AAGAGGGGAA GGAAGGTTTC TATTAGACGA AGCCCTAAAC TTACGCCAAG GAAAGCGTCT GACAGGAAGA CTGCTCAAAT TAGCCGTATC GCTGGCAACC CGGTTACCCT TCAGGCAGGC AGCGGAAATA ATGGCCGAAG CAGGGATGGG CCAATTAAGT CATATGACCA TCCATAGCGA AGTAAAACGA AATGGACTGG AACAAAAAGA ACTGCAAGAA GCCCTGCGCA ATAATCTATT CATGAGCGGG GAAGAGCCCC AAGGCAAAAA GAAAAAAGTA CCGGTACTAT TTATCGAAGC CGATGGTATA ATGATCCCGC TGCAAAGGAG CAAGCAAGAC CGGATAGAAG TCAAAGTAGG AATAGTTTAC GAAGGGTGGA TAGAAAAAGG GAATGCCCGG CATCTCAAGA ACCCGCGGGT AGTGATGGGC ATCTATGAAG ATGGAGAACA ATTTTGGGAA GCCCTCACCA CGGAAATAGC CAGGTACTAC GAGATAGACG AAAAAACAAT ATATGTCGTC AATGGCGACG GAGCCAGCTG GGTCCAGAAG ACAGCCAAAG AACAGTTACC AGGAGCCATC GTACAATTAG ACCGCTACCA CCTCCACCGG GATATAAGGC AGGCATATGG GAACGAAACA GCGCAGGGAT TAATGAAGAC TTTAGCCAAA GGTCAAGAGC AGGTCTTTTT AGACACCCTG GAAGCACTCA TAGAAGAAGC ACCGAACCGC AAAAACAAGC AACAATGCCA AAAAGTATAT GACTACTGTC AAAGATATCG CGATAACTTG TTAGATTACC GCTTGCGGTT ACCACGACAG CTGGAAGGGC AAAAGTTATA CGGGATGGGC GTAGCCGAAA CAACAGTAGA CAAAAAAATA GCCATCCGCA TGAAAAAGAG GGGGATGAGC TGGAGCGAAG CAGGAGCAAC GGCCATGGTA GCATTACTAA TGCTCAAAGC CAATGGAGAA TTAGCCGCAT GGTTAGAAAA AAAGATGCCC CAAGTAGAAA AGAATCCCGT TAAAGTAATA AAAGAAAAGA AGATAAGTAA AGAAGACGTA GAAGAATGGT TAAGGAAGAG AGTACCAGCC CTTGTTGGCC CTGAGGCGGG AACAGATTGG GTTAAATATA CCATGAGGCA ACTAACAAGA ATTAGTGGAG CTATATTCTA A
|
Protein sequence | MVNGSTSTAT IISLLDGIEN FNTLEEVILQ IARRLLVAVL EALDDTLMPA KPKGYKIAGF RYRTITCLYG DITFKRRLYV KATRKKKRGE GRFLLDEALN LRQGKRLTGR LLKLAVSLAT RLPFRQAAEI MAEAGMGQLS HMTIHSEVKR NGLEQKELQE ALRNNLFMSG EEPQGKKKKV PVLFIEADGI MIPLQRSKQD RIEVKVGIVY EGWIEKGNAR HLKNPRVVMG IYEDGEQFWE ALTTEIARYY EIDEKTIYVV NGDGASWVQK TAKEQLPGAI VQLDRYHLHR DIRQAYGNET AQGLMKTLAK GQEQVFLDTL EALIEEAPNR KNKQQCQKVY DYCQRYRDNL LDYRLRLPRQ LEGQKLYGMG VAETTVDKKI AIRMKKRGMS WSEAGATAMV ALLMLKANGE LAAWLEKKMP QVEKNPVKVI KEKKISKEDV EEWLRKRVPA LVGPEAGTDW VKYTMRQLTR ISGAIF
|
| |