Gene Moth_0765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0765 
Symbol 
ID3831478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp802031 
End bp803929 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content51% 
IMG OID637828696 
Productflagellar hook-associated 2-like 
Protein accessionYP_429626 
Protein GI83589617 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0210508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGCG GCATTTATTT TTCCGGCCTG GCTTCGGGAC TGGATACCGA GTCGATTATC 
ACGCAGTTAA TGGACCTCGA AAGGATACCC CTTACCAGGC TGCAGCAGCG CAAAAACCAG
TACAACGTGG AGAAAAACGC CTGGCACGAT ATTTATACCC GTTTAAGCAG CTTGCAGAGC
AAGCTTGGCG ATCTTAAACT TGCTTCCACC TTTACCGGTA TGAAAGCCAC CTCAAGTAAT
ACCTTGGCTC TGACGGCTAC TGCAGCCAGC AATGCTCCAG CCGGAAGTTA CCAGGTGAGC
ATTATCCAGC TAGCCCAGGC TCACAAAGTG GCCAGCCAGA ATCTGGTTTA TGGTACGGAA
GCGCAATTAT TAACAGATAC TTTTACTGAC GCCACCTACA CCAGCAGTGT GGCGACGTTG
AACAACTTAA CCCAGGATAC GGCAAATGGC TTGCTAAAAT TAGCCTCAGG AGCAACGGGG
AGTATTACTT CTAATGCTAT TAACGTTAAT GCCGCCAATG GCGGCCGCCT GGCTTTGACG
GTCAATCAAC AGGAACGGCT GAATGCTAAT TATGTGTATC AATACCGTAC TTTTGATGGC
ACTACCTGGA GTGAATGGCA AACGCTGGGC AATTTAACCG GTGATGGTTC CGGGATTTTT
AAGAGCCATA CTTTAACAGC TGATATAAGT GGCAATGTGC AACAGGTGGA GATTAAAGCT
ACTTTAAATG GAGATTTTGG TGCCAGTGTT ATTCCTATGC TTGCCGACTG GACGGCCACC
TTCCAACCAG CAGAACCGGT TACGTCCGAT ACTGTGGCGC TGGGCCTGAG CGGTTCCTTT
ACCATTACCG TTGGTAGTGA AACCCGGAAT ATTACCATAA ATGAAAATGA TAGCCTACAA
TCCATCGCCT CCCTGATCAA TGCAGTACCT TCTGAAGGGG AAACAGGCCC GGGAGCGGGG
GACATTGTGA CGGCCAGCGT CATCGATCAT CGCTTGGTAA TTACCAGTAA AACTACCGGC
TCTAACGGGG CTATATCTTT TTCCGATCCT GACGGTGTCC TCAATAAGTT AGGGCTGGTA
GATGCAAGCG GGGTTATTCT ACCACGCGCC GTCATCCAGG ATGCCAAGGA TGCCGTATTT
ACAGTCGATG GCCTCACTAT AACCCGCTCC ACCAATACGA TTACAGATGT CATCCAAGGG
GTTACCCTGA ACCTCCTGGC CGTTACTGAC ACTAACGGCA ACGGCACAAT TGAACCGGCG
GAAACACTGA ACCTGGAGAT AAGCCACGAC ACTCAGAAGG CCGTTGATGC TATCCAGGCC
ATGGTGGATC AATACAACTC AGTAATGGAG TTTATCAGCA CCAAAGCCGG AGACAAGGGT
GATTTACAGG GCGATCCTAC CCTGGCGCGT TTTAAAAACG ATTTATGGCA ACTGATGACT
GATAGAGTAG CCGGGTTAAC GGGGACTTAC CAGACCCCCT GGAGTATTGG TATTTCGACA
GGGGCTGTAG TAGGTAGCGG TTCTTTAACC TTCGATCGCA ACGGCAAAAT AACCCTGGAT
ACAACAAAGT TAACCTCGGC TTTGGAGACG GACCCAACAG CCGTCATGGC TATTTTTACC
AACAGCAGCG AAACCGGCTT GGTCGACAAG TTGGATAGTT ACCTGACTTC CCTGGTGCGC
TCCGGGGACG GCATTATTCC TTCCCGGGAG CAGTCCCTGC AGAACATCAT GGATGACATC
GACGACCAGA TCGCCCGCAT GGAAGACCGG CTCACCATGA GGGAAGAGCA GCTCCGGCGG
CAGTTTACGG CTATGGAGCA GGCCCTGGCG GCTTTGCAGA GCCAGGGGAA CTGGCTGGCC
GGGCAGATTG CCGGATTGGG GGCCTACCAG CAGAAATAG
 
Protein sequence
MASGIYFSGL ASGLDTESII TQLMDLERIP LTRLQQRKNQ YNVEKNAWHD IYTRLSSLQS 
KLGDLKLAST FTGMKATSSN TLALTATAAS NAPAGSYQVS IIQLAQAHKV ASQNLVYGTE
AQLLTDTFTD ATYTSSVATL NNLTQDTANG LLKLASGATG SITSNAINVN AANGGRLALT
VNQQERLNAN YVYQYRTFDG TTWSEWQTLG NLTGDGSGIF KSHTLTADIS GNVQQVEIKA
TLNGDFGASV IPMLADWTAT FQPAEPVTSD TVALGLSGSF TITVGSETRN ITINENDSLQ
SIASLINAVP SEGETGPGAG DIVTASVIDH RLVITSKTTG SNGAISFSDP DGVLNKLGLV
DASGVILPRA VIQDAKDAVF TVDGLTITRS TNTITDVIQG VTLNLLAVTD TNGNGTIEPA
ETLNLEISHD TQKAVDAIQA MVDQYNSVME FISTKAGDKG DLQGDPTLAR FKNDLWQLMT
DRVAGLTGTY QTPWSIGIST GAVVGSGSLT FDRNGKITLD TTKLTSALET DPTAVMAIFT
NSSETGLVDK LDSYLTSLVR SGDGIIPSRE QSLQNIMDDI DDQIARMEDR LTMREEQLRR
QFTAMEQALA ALQSQGNWLA GQIAGLGAYQ QK