Gene Moth_0776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0776 
Symbol 
ID3831013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp812563 
End bp814173 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content67% 
IMG OID637828707 
Productflagellar hook-length control protein 
Protein accessionYP_429637 
Protein GI83589628 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.460138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCG CGTCTATTAG CAGGTCCAGT GACCTGGAAC TGCTGACCAG GACCAGGGAG 
CACTATCCAG AGCCGGCGGT AGATTTTATT TCCTTCCTGG CGGGGCTGAT CCAGCAACCG
GCCCACCTTG ATCGTCCTGG GGTTTACAAT CCCTCCCAGG GGGAAACCGG GGGTCAGGAC
GGGAACGCCA GGGGAGGATT GCAGGAGCCG GCCGTGCCTG AACCGGCAGG CATAGCGGGC
GGACCCGTCA GGAAGGCGAT GGGAAGCCCC CGGGAGGCTG ATGCCCGGCC GACAGGGCCG
GGAGACGGCT ACCAGGCCAA CGCAGGGGCG GAAGGGACGG CCGGATCCGG TTCCAGTGGG
CAGGTCACTG GCCGGGACGC TCCTGCCGCC GCTGCCGGCA AGGAAGCTCC CGCCGGCGGG
AAAGGCGCTC CCCGCAGCGG CTTTCAGGCC GGTGACAGCA GGCGAACCGC CGCCGCGCAG
GATAAGGTCG CGGCCTGGGA TCAGGTAAAG GGCGAAACAT CCTCCTCGGC CGGGAAAGCC
GTGGAAGCGC ACCAGGCGAC TTCGCTCCTC GCCGTCGCTG CCGCTAACGT GCAGGACGAG
GCCGGCGCCG GGGCCGCCGG CAGAGGTTCC AGACAGCTGC TCCGGGGAAA CTGGCCTGAC
CTGGCAAGCC GGATTGAGGC CGTACCCTGG ATGGGAGCGG CCAGGAAGGC CGCCGTGATA
AAAGGTAAAG GCCAGGTCCA GGGGGGCCTG CCGGCAGGAG TAAACACCCG GGAGACGGCG
ACGACCGGCC CGGGCACCCC GGGAACAACC CGACCGGCCG GGACCCCTGC GGAAGGGGCC
TTAAAAACCG GCCTGCCTTT AGAGGGAGAA GGGCGACAAG CCAGCGCCGG GAAACCCGTT
ACCGATGCAA ATAGTATGGA AATTGCTGAC AGGTCGGCAG GGATGCCCTT TGTTCCCGAT
AACCGGCAGG CGCCGATGGC TTCCGGGGAA GGGAGACCCC CTGTCAGAGG GCCGGTTCCC
GGTAACAGCT TGCTGGCCGG TGACTATAAT CCTGCAAATA CCGGGCCGAG TAACCGCGAG
CCGGCCGGGG AGACCCTGAC AAGCAGCGGT GGTAGTTCGC CGGTGGGGCC GGCCGCTTTT
AACGCCGTCA TGGGAGGAAA CAGCCAGCAG GCGGGGAACC AGGTAGCCCA GATCAACAAC
CTGCCGGAGG TGTTTGCCAC CATCCTCAGC ACAGCCCGCC TGGCCGCCAC CAACGGCAGG
CAGGAACTGG AACTCCAGCT GCAACCGGAA AACCTGGGTA AATTGAAACT CCGGGCTTTA
CTGGACGGGG GGCGGTTGAC CCTGCAACTG CTGGTAGAAA GCAGCGAAGC CGCCCGGGCC
CTCCAGGCGG CGGTGCCGGA AATGCGCCAG GCCGCGGCCG TCCAGGGCCT GCGCCTGGAC
CAGGTCCAGG TGCAGGTGGG CGGCGATGGC CAGGGCGGCG GCCGCCACCA GGCAGACAGC
CAGGGTGAAT ACCGCCAGGG TGCCGGCTGG CGGCGCCAGT CCCCGGGGTG GCCGGGTAGC
CCGGACCTGG AAGGAACCAT TAACCGGTAC CGGCTGGATT ACCTGGCCTG A
 
Protein sequence
MTVASISRSS DLELLTRTRE HYPEPAVDFI SFLAGLIQQP AHLDRPGVYN PSQGETGGQD 
GNARGGLQEP AVPEPAGIAG GPVRKAMGSP READARPTGP GDGYQANAGA EGTAGSGSSG
QVTGRDAPAA AAGKEAPAGG KGAPRSGFQA GDSRRTAAAQ DKVAAWDQVK GETSSSAGKA
VEAHQATSLL AVAAANVQDE AGAGAAGRGS RQLLRGNWPD LASRIEAVPW MGAARKAAVI
KGKGQVQGGL PAGVNTRETA TTGPGTPGTT RPAGTPAEGA LKTGLPLEGE GRQASAGKPV
TDANSMEIAD RSAGMPFVPD NRQAPMASGE GRPPVRGPVP GNSLLAGDYN PANTGPSNRE
PAGETLTSSG GSSPVGPAAF NAVMGGNSQQ AGNQVAQINN LPEVFATILS TARLAATNGR
QELELQLQPE NLGKLKLRAL LDGGRLTLQL LVESSEAARA LQAAVPEMRQ AAAVQGLRLD
QVQVQVGGDG QGGGRHQADS QGEYRQGAGW RRQSPGWPGS PDLEGTINRY RLDYLA