Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0776 |
Symbol | |
ID | 3831013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 812563 |
End bp | 814173 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637828707 |
Product | flagellar hook-length control protein |
Protein accession | YP_429637 |
Protein GI | 83589628 |
COG category | [N] Cell motility |
COG ID | [COG3144] Flagellar hook-length control protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.460138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTCG CGTCTATTAG CAGGTCCAGT GACCTGGAAC TGCTGACCAG GACCAGGGAG CACTATCCAG AGCCGGCGGT AGATTTTATT TCCTTCCTGG CGGGGCTGAT CCAGCAACCG GCCCACCTTG ATCGTCCTGG GGTTTACAAT CCCTCCCAGG GGGAAACCGG GGGTCAGGAC GGGAACGCCA GGGGAGGATT GCAGGAGCCG GCCGTGCCTG AACCGGCAGG CATAGCGGGC GGACCCGTCA GGAAGGCGAT GGGAAGCCCC CGGGAGGCTG ATGCCCGGCC GACAGGGCCG GGAGACGGCT ACCAGGCCAA CGCAGGGGCG GAAGGGACGG CCGGATCCGG TTCCAGTGGG CAGGTCACTG GCCGGGACGC TCCTGCCGCC GCTGCCGGCA AGGAAGCTCC CGCCGGCGGG AAAGGCGCTC CCCGCAGCGG CTTTCAGGCC GGTGACAGCA GGCGAACCGC CGCCGCGCAG GATAAGGTCG CGGCCTGGGA TCAGGTAAAG GGCGAAACAT CCTCCTCGGC CGGGAAAGCC GTGGAAGCGC ACCAGGCGAC TTCGCTCCTC GCCGTCGCTG CCGCTAACGT GCAGGACGAG GCCGGCGCCG GGGCCGCCGG CAGAGGTTCC AGACAGCTGC TCCGGGGAAA CTGGCCTGAC CTGGCAAGCC GGATTGAGGC CGTACCCTGG ATGGGAGCGG CCAGGAAGGC CGCCGTGATA AAAGGTAAAG GCCAGGTCCA GGGGGGCCTG CCGGCAGGAG TAAACACCCG GGAGACGGCG ACGACCGGCC CGGGCACCCC GGGAACAACC CGACCGGCCG GGACCCCTGC GGAAGGGGCC TTAAAAACCG GCCTGCCTTT AGAGGGAGAA GGGCGACAAG CCAGCGCCGG GAAACCCGTT ACCGATGCAA ATAGTATGGA AATTGCTGAC AGGTCGGCAG GGATGCCCTT TGTTCCCGAT AACCGGCAGG CGCCGATGGC TTCCGGGGAA GGGAGACCCC CTGTCAGAGG GCCGGTTCCC GGTAACAGCT TGCTGGCCGG TGACTATAAT CCTGCAAATA CCGGGCCGAG TAACCGCGAG CCGGCCGGGG AGACCCTGAC AAGCAGCGGT GGTAGTTCGC CGGTGGGGCC GGCCGCTTTT AACGCCGTCA TGGGAGGAAA CAGCCAGCAG GCGGGGAACC AGGTAGCCCA GATCAACAAC CTGCCGGAGG TGTTTGCCAC CATCCTCAGC ACAGCCCGCC TGGCCGCCAC CAACGGCAGG CAGGAACTGG AACTCCAGCT GCAACCGGAA AACCTGGGTA AATTGAAACT CCGGGCTTTA CTGGACGGGG GGCGGTTGAC CCTGCAACTG CTGGTAGAAA GCAGCGAAGC CGCCCGGGCC CTCCAGGCGG CGGTGCCGGA AATGCGCCAG GCCGCGGCCG TCCAGGGCCT GCGCCTGGAC CAGGTCCAGG TGCAGGTGGG CGGCGATGGC CAGGGCGGCG GCCGCCACCA GGCAGACAGC CAGGGTGAAT ACCGCCAGGG TGCCGGCTGG CGGCGCCAGT CCCCGGGGTG GCCGGGTAGC CCGGACCTGG AAGGAACCAT TAACCGGTAC CGGCTGGATT ACCTGGCCTG A
|
Protein sequence | MTVASISRSS DLELLTRTRE HYPEPAVDFI SFLAGLIQQP AHLDRPGVYN PSQGETGGQD GNARGGLQEP AVPEPAGIAG GPVRKAMGSP READARPTGP GDGYQANAGA EGTAGSGSSG QVTGRDAPAA AAGKEAPAGG KGAPRSGFQA GDSRRTAAAQ DKVAAWDQVK GETSSSAGKA VEAHQATSLL AVAAANVQDE AGAGAAGRGS RQLLRGNWPD LASRIEAVPW MGAARKAAVI KGKGQVQGGL PAGVNTRETA TTGPGTPGTT RPAGTPAEGA LKTGLPLEGE GRQASAGKPV TDANSMEIAD RSAGMPFVPD NRQAPMASGE GRPPVRGPVP GNSLLAGDYN PANTGPSNRE PAGETLTSSG GSSPVGPAAF NAVMGGNSQQ AGNQVAQINN LPEVFATILS TARLAATNGR QELELQLQPE NLGKLKLRAL LDGGRLTLQL LVESSEAARA LQAAVPEMRQ AAAVQGLRLD QVQVQVGGDG QGGGRHQADS QGEYRQGAGW RRQSPGWPGS PDLEGTINRY RLDYLA
|
| |