Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0765 |
Symbol | |
ID | 3831478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 802031 |
End bp | 803929 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637828696 |
Product | flagellar hook-associated 2-like |
Protein accession | YP_429626 |
Protein GI | 83589617 |
COG category | [N] Cell motility |
COG ID | [COG1345] Flagellar capping protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0210508 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGCG GCATTTATTT TTCCGGCCTG GCTTCGGGAC TGGATACCGA GTCGATTATC ACGCAGTTAA TGGACCTCGA AAGGATACCC CTTACCAGGC TGCAGCAGCG CAAAAACCAG TACAACGTGG AGAAAAACGC CTGGCACGAT ATTTATACCC GTTTAAGCAG CTTGCAGAGC AAGCTTGGCG ATCTTAAACT TGCTTCCACC TTTACCGGTA TGAAAGCCAC CTCAAGTAAT ACCTTGGCTC TGACGGCTAC TGCAGCCAGC AATGCTCCAG CCGGAAGTTA CCAGGTGAGC ATTATCCAGC TAGCCCAGGC TCACAAAGTG GCCAGCCAGA ATCTGGTTTA TGGTACGGAA GCGCAATTAT TAACAGATAC TTTTACTGAC GCCACCTACA CCAGCAGTGT GGCGACGTTG AACAACTTAA CCCAGGATAC GGCAAATGGC TTGCTAAAAT TAGCCTCAGG AGCAACGGGG AGTATTACTT CTAATGCTAT TAACGTTAAT GCCGCCAATG GCGGCCGCCT GGCTTTGACG GTCAATCAAC AGGAACGGCT GAATGCTAAT TATGTGTATC AATACCGTAC TTTTGATGGC ACTACCTGGA GTGAATGGCA AACGCTGGGC AATTTAACCG GTGATGGTTC CGGGATTTTT AAGAGCCATA CTTTAACAGC TGATATAAGT GGCAATGTGC AACAGGTGGA GATTAAAGCT ACTTTAAATG GAGATTTTGG TGCCAGTGTT ATTCCTATGC TTGCCGACTG GACGGCCACC TTCCAACCAG CAGAACCGGT TACGTCCGAT ACTGTGGCGC TGGGCCTGAG CGGTTCCTTT ACCATTACCG TTGGTAGTGA AACCCGGAAT ATTACCATAA ATGAAAATGA TAGCCTACAA TCCATCGCCT CCCTGATCAA TGCAGTACCT TCTGAAGGGG AAACAGGCCC GGGAGCGGGG GACATTGTGA CGGCCAGCGT CATCGATCAT CGCTTGGTAA TTACCAGTAA AACTACCGGC TCTAACGGGG CTATATCTTT TTCCGATCCT GACGGTGTCC TCAATAAGTT AGGGCTGGTA GATGCAAGCG GGGTTATTCT ACCACGCGCC GTCATCCAGG ATGCCAAGGA TGCCGTATTT ACAGTCGATG GCCTCACTAT AACCCGCTCC ACCAATACGA TTACAGATGT CATCCAAGGG GTTACCCTGA ACCTCCTGGC CGTTACTGAC ACTAACGGCA ACGGCACAAT TGAACCGGCG GAAACACTGA ACCTGGAGAT AAGCCACGAC ACTCAGAAGG CCGTTGATGC TATCCAGGCC ATGGTGGATC AATACAACTC AGTAATGGAG TTTATCAGCA CCAAAGCCGG AGACAAGGGT GATTTACAGG GCGATCCTAC CCTGGCGCGT TTTAAAAACG ATTTATGGCA ACTGATGACT GATAGAGTAG CCGGGTTAAC GGGGACTTAC CAGACCCCCT GGAGTATTGG TATTTCGACA GGGGCTGTAG TAGGTAGCGG TTCTTTAACC TTCGATCGCA ACGGCAAAAT AACCCTGGAT ACAACAAAGT TAACCTCGGC TTTGGAGACG GACCCAACAG CCGTCATGGC TATTTTTACC AACAGCAGCG AAACCGGCTT GGTCGACAAG TTGGATAGTT ACCTGACTTC CCTGGTGCGC TCCGGGGACG GCATTATTCC TTCCCGGGAG CAGTCCCTGC AGAACATCAT GGATGACATC GACGACCAGA TCGCCCGCAT GGAAGACCGG CTCACCATGA GGGAAGAGCA GCTCCGGCGG CAGTTTACGG CTATGGAGCA GGCCCTGGCG GCTTTGCAGA GCCAGGGGAA CTGGCTGGCC GGGCAGATTG CCGGATTGGG GGCCTACCAG CAGAAATAG
|
Protein sequence | MASGIYFSGL ASGLDTESII TQLMDLERIP LTRLQQRKNQ YNVEKNAWHD IYTRLSSLQS KLGDLKLAST FTGMKATSSN TLALTATAAS NAPAGSYQVS IIQLAQAHKV ASQNLVYGTE AQLLTDTFTD ATYTSSVATL NNLTQDTANG LLKLASGATG SITSNAINVN AANGGRLALT VNQQERLNAN YVYQYRTFDG TTWSEWQTLG NLTGDGSGIF KSHTLTADIS GNVQQVEIKA TLNGDFGASV IPMLADWTAT FQPAEPVTSD TVALGLSGSF TITVGSETRN ITINENDSLQ SIASLINAVP SEGETGPGAG DIVTASVIDH RLVITSKTTG SNGAISFSDP DGVLNKLGLV DASGVILPRA VIQDAKDAVF TVDGLTITRS TNTITDVIQG VTLNLLAVTD TNGNGTIEPA ETLNLEISHD TQKAVDAIQA MVDQYNSVME FISTKAGDKG DLQGDPTLAR FKNDLWQLMT DRVAGLTGTY QTPWSIGIST GAVVGSGSLT FDRNGKITLD TTKLTSALET DPTAVMAIFT NSSETGLVDK LDSYLTSLVR SGDGIIPSRE QSLQNIMDDI DDQIARMEDR LTMREEQLRR QFTAMEQALA ALQSQGNWLA GQIAGLGAYQ QK
|
| |