Gene Moth_0760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0760 
Symbol 
ID3831473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp796900 
End bp798042 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content49% 
IMG OID637828691 
Productflagellin-like 
Protein accessionYP_429621 
Protein GI83589612 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.382669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCA ACCAGAACAT TTCGGCCTTA AATGCTTACC GGAACCTGAC GGTAACAAAT 
AACGCCCTAA CCAAGTCAAT GGAGAAGCTA TCTTCTGGTC TGAGAATCAA TCGGGCCGCC
GACGATGCCG CCGGGCTGGC CATTAGCGAG AAGATGCGTG GCCAAATCCG GGGCTTAAAC
CAGGCGGTGC GCAATGCCCA GGATGGCATA TCTCTGATTC AAACGGCCGA AGGGGCACTA
AATGAGACCC ATAGTATTTT ACAACGAATG CGAGAACTTG CTGTTCAGTC GGCCAACGAT
ACCAATACTG CTGCCGACCG CTGGGAGATC CAGAAGGAAA TAAATCAGCT GTCTGAAGAA
TTGACTCGTA TCGCTAATAC TACTGAGTTT AACACCAAGA ACCTTTTAGG GGGCAATTTC
GAAGGAACCT TCCAGATAGG AGCAAATAAA GATCAAAATA TAACCTTGCA AATTGGTGGC
ATGAAATCAA GCGATTTAAA AACAGAAGTC AGCGGTTACG CTGCCGCTGT AAACCCTGTA
GATTCTGTGT TGACTGGCGC TGCCAACCTT AAACCAGCGA GTGGTAAAGC CATTCTGGGC
GGCGCATACG ATGTTTCATT AGCAACGGGC AGTGCTGCTG GTAATATAAA AATAACGGTA
GTAATTAAGA ATACCGCAGG AAGTTCTATT AGCCTTACGG GTAGCGCAGG TGTGGCTAGT
GGCAAACTTA CGGCAAAAAA TGGTAGTTAT TCTATCGACT TCACTTGGAG CGGAACTGCT
GTAGCAGGTA CAAGTGCTAA ACTTTATATA GTAGGCAAAG ACGAAAATGG TAACAACCAG
GGAATCAGCG TCATCACCCA GGCAGATGCC AATAAAGCTA TTACAACTAT AAATAACGCC
ATCGAAACTG TTTCTGCCGA ACGTTCCAAA CTGGGCGCCT ACCAGAACCG CCTGGAACAT
ACCATTGCCA ACCTGGGCAC GGCGGCGGAA AACCTGACGG CGGCCGAGTC CCGCATCCGC
GATCTGGATA TGGCCCAGGA AATGATGGCC TTTACCAGGA ACCAGATCCT TAGCCAGGCC
GGTACGGCTA TGCTAGCCCA GGCCAATGCC CAACCACAGA CCGTACTGCA GTTATTACGG
TAA
 
Protein sequence
MRINQNISAL NAYRNLTVTN NALTKSMEKL SSGLRINRAA DDAAGLAISE KMRGQIRGLN 
QAVRNAQDGI SLIQTAEGAL NETHSILQRM RELAVQSAND TNTAADRWEI QKEINQLSEE
LTRIANTTEF NTKNLLGGNF EGTFQIGANK DQNITLQIGG MKSSDLKTEV SGYAAAVNPV
DSVLTGAANL KPASGKAILG GAYDVSLATG SAAGNIKITV VIKNTAGSSI SLTGSAGVAS
GKLTAKNGSY SIDFTWSGTA VAGTSAKLYI VGKDENGNNQ GISVITQADA NKAITTINNA
IETVSAERSK LGAYQNRLEH TIANLGTAAE NLTAAESRIR DLDMAQEMMA FTRNQILSQA
GTAMLAQANA QPQTVLQLLR