Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0760 |
Symbol | |
ID | 3831473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 796900 |
End bp | 798042 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637828691 |
Product | flagellin-like |
Protein accession | YP_429621 |
Protein GI | 83589612 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.382669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATCA ACCAGAACAT TTCGGCCTTA AATGCTTACC GGAACCTGAC GGTAACAAAT AACGCCCTAA CCAAGTCAAT GGAGAAGCTA TCTTCTGGTC TGAGAATCAA TCGGGCCGCC GACGATGCCG CCGGGCTGGC CATTAGCGAG AAGATGCGTG GCCAAATCCG GGGCTTAAAC CAGGCGGTGC GCAATGCCCA GGATGGCATA TCTCTGATTC AAACGGCCGA AGGGGCACTA AATGAGACCC ATAGTATTTT ACAACGAATG CGAGAACTTG CTGTTCAGTC GGCCAACGAT ACCAATACTG CTGCCGACCG CTGGGAGATC CAGAAGGAAA TAAATCAGCT GTCTGAAGAA TTGACTCGTA TCGCTAATAC TACTGAGTTT AACACCAAGA ACCTTTTAGG GGGCAATTTC GAAGGAACCT TCCAGATAGG AGCAAATAAA GATCAAAATA TAACCTTGCA AATTGGTGGC ATGAAATCAA GCGATTTAAA AACAGAAGTC AGCGGTTACG CTGCCGCTGT AAACCCTGTA GATTCTGTGT TGACTGGCGC TGCCAACCTT AAACCAGCGA GTGGTAAAGC CATTCTGGGC GGCGCATACG ATGTTTCATT AGCAACGGGC AGTGCTGCTG GTAATATAAA AATAACGGTA GTAATTAAGA ATACCGCAGG AAGTTCTATT AGCCTTACGG GTAGCGCAGG TGTGGCTAGT GGCAAACTTA CGGCAAAAAA TGGTAGTTAT TCTATCGACT TCACTTGGAG CGGAACTGCT GTAGCAGGTA CAAGTGCTAA ACTTTATATA GTAGGCAAAG ACGAAAATGG TAACAACCAG GGAATCAGCG TCATCACCCA GGCAGATGCC AATAAAGCTA TTACAACTAT AAATAACGCC ATCGAAACTG TTTCTGCCGA ACGTTCCAAA CTGGGCGCCT ACCAGAACCG CCTGGAACAT ACCATTGCCA ACCTGGGCAC GGCGGCGGAA AACCTGACGG CGGCCGAGTC CCGCATCCGC GATCTGGATA TGGCCCAGGA AATGATGGCC TTTACCAGGA ACCAGATCCT TAGCCAGGCC GGTACGGCTA TGCTAGCCCA GGCCAATGCC CAACCACAGA CCGTACTGCA GTTATTACGG TAA
|
Protein sequence | MRINQNISAL NAYRNLTVTN NALTKSMEKL SSGLRINRAA DDAAGLAISE KMRGQIRGLN QAVRNAQDGI SLIQTAEGAL NETHSILQRM RELAVQSAND TNTAADRWEI QKEINQLSEE LTRIANTTEF NTKNLLGGNF EGTFQIGANK DQNITLQIGG MKSSDLKTEV SGYAAAVNPV DSVLTGAANL KPASGKAILG GAYDVSLATG SAAGNIKITV VIKNTAGSSI SLTGSAGVAS GKLTAKNGSY SIDFTWSGTA VAGTSAKLYI VGKDENGNNQ GISVITQADA NKAITTINNA IETVSAERSK LGAYQNRLEH TIANLGTAAE NLTAAESRIR DLDMAQEMMA FTRNQILSQA GTAMLAQANA QPQTVLQLLR
|
| |