Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0747 |
Symbol | flgL |
ID | 3831139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 783381 |
End bp | 784298 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637828678 |
Product | flagellar hook-associated protein FlgL |
Protein accession | YP_429608 |
Protein GI | 83589599 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | [TIGR02550] flagellar hook-associated protein 3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCGTTA CCAACCGGAT GCTCAACAAC AATGTCATTC GCAACATCAA TCGTAACCTG GAGAATATGG CCCGTACCCA GGAGCAGATG TCTTCCGGCA AGAGAGTAAA CCGGCCATCT GACGATCCCA TTGTCGTCGC CCGGGTCCTG GCCTTTAAGA CCTCCATCGC CGCAAACGAC CAGTATAAAA AGAACATGGA AGACGCCAAG GGGTGGCTCG ACGCCTCGGA AAGCGTCCTG GGCATGGCCA CAGATGTCCT CCAGCGGGCC AGGGAACTGG CAGTTTATGG CTCCAATGGG ACAATGCCCC CGGAATCCAT GGATGCCCTT GGGAAAGAGG TAGATCAGCT CCTCGACGAA ATGATGCAGG TCGCCAATAC CTCTTACGGC GGCCGTTACA TATTTGGTGG CAGCCAGACG ACGGCTACAC CATTTGTTAG TAACTCGTCT AGTACGGACG ATCCTGTAGA GTACAATGGT GATTCAGCAG CTTTAAATTG GGAAATTGCC CCGGGTGTTA CTATCAGCGT TAACGAAAAT GGCGATCAAA TATTTATGCA GGCAATAAAT AATGGTAGTG CTACTGAATC CATATTTAAA TTATTAAACG ATTTAAGCGC TGCCTTACAT GGCGGGACAG CCACGGATGT TTCGGCAACT TTAAATAAGT TCGACCAGGC TATCGATCAT ATCCTCAATA TCCGCGCCAC CTTGGGTGCC AAAAGCAACC GCATGGAGAT GGCTATGTCG CGCCTGGAAG ATACCCAAAT TGGCCTGACG CAAACCATGT CCAAGCTAGA AGATATCGAC CTGGCGGAGA CCGTTATGAA CTATAAAACC CAGGAAAACG TCTACCTCGC CTCCCTGTCC ACCGGCGCCA AGGTCCTTCA GCCAAGCCTG ATCGACTATT TGCGTTAA
|
Protein sequence | MRVTNRMLNN NVIRNINRNL ENMARTQEQM SSGKRVNRPS DDPIVVARVL AFKTSIAAND QYKKNMEDAK GWLDASESVL GMATDVLQRA RELAVYGSNG TMPPESMDAL GKEVDQLLDE MMQVANTSYG GRYIFGGSQT TATPFVSNSS STDDPVEYNG DSAALNWEIA PGVTISVNEN GDQIFMQAIN NGSATESIFK LLNDLSAALH GGTATDVSAT LNKFDQAIDH ILNIRATLGA KSNRMEMAMS RLEDTQIGLT QTMSKLEDID LAETVMNYKT QENVYLASLS TGAKVLQPSL IDYLR
|
| |