Gene Moth_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1552 
Symbol 
ID3832185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1594323 
End bp1595531 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content63% 
IMG OID637829484 
Producttype II secretion system protein 
Protein accessionYP_430404 
Protein GI83590395 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0298839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTTG CCTACCGGGG AAGGGATGCC AGGGGGAATG CAGTTAGCGG GAATCTAGAA 
GCGGAAAACC AGGAAGTGGC CATCCAGGAA CTGCGGCGGC AGGGGATCTT TATTACTTCC
TTACGGCCGG TAGATGTCAA GCCTGCCCTG GACTGGCGGG GCCTCTTCCG CCGGCCGGTA
TCCCGCCGCG ACCTGGCCCT CTTCTGTCGT CAACTGGCGA CTATGCTGGG GGCCGGCCTG
CCGATTATGT CCGCCCTGCG TGTTTTACAG CGGCAGGTAG AAAATCCTTC CCTGCGTGAA
AGCATCGGTG TGCTCCTCCA GGACCTGGAG TCCGGAGAGG CCTTTTACCG GGCCCTGGGA
CGGCAGCCCC GGGTCTATCC CCCTATCGTT ATCCATACCG TCGAGGCCGG TGAACTGGGA
GGCGCCCTGG ACGAGACCCT GGAGCGCCTG GCCGGGCACC TGGAAAGGGA GCATGAGGTG
GAGGAAAAGG TCAAATCGGC CCTGACCTAC CCGGCGGTGG TGAGCCTGGT AGCGGTGGGG
GCGGTTATCT TCCTCCTCAC CTACGTCCTG CCCTCCTTTC AGGTGATGTT AAACAGCCTG
CAGGTTCCCC TGCCCTGGCC GACGCGGATG ATCCTGGGGT TGAGTGAAGG ACTGCGGCGC
TGGTGGTTTG TCCTGGTACT TCTCCTGGCT GGCGCCGGTT ACGGGTTTTA CCGCTGGCGG
CAGGGTCCGG ACGGGCGGTA TCAGGTCGAC CACCTGCTTT TACGTTTGCC GATCTTCGGA
CCCGTACATC AGAAGACACT TTTAGCCCGC TTCAGCCGTA CCCTGGGTAC CCTCCTGCAC
AGCGGCGTGC CCGTTTTGCT GGCCCTGGAA GTAGTGCGGC GCACCGTGGG CAACGCCGTG
GTGGCCAGAG CGGTGGAGCG GGCGGCCGAG AGCGTACGCG ACGGGCAGAG CCTGGCGGCT
CCCCTGGAAG CGAGCGGGAT CTTCCCTCCC ATGATGGTGG AAATGATTAC GGTGGGCGAG
GAAACGGGAG CCCTGGACGC TATGCTCGAG CGGGTGGCAG TCCTTTATGA GCTGGAGGTG
GAGGCGGTCG TAAGCCGCCT GGCCTCACTG ATAGAACCCG TCCTCATCGT CGGCCTCGGC
GGCATAGTAG GGCTGATCGT GATTTCGGTC TTTCTGCCCT ACTTTCAGAT GCTGGGCGGC
ATAAAGTAG
 
Protein sequence
MNFAYRGRDA RGNAVSGNLE AENQEVAIQE LRRQGIFITS LRPVDVKPAL DWRGLFRRPV 
SRRDLALFCR QLATMLGAGL PIMSALRVLQ RQVENPSLRE SIGVLLQDLE SGEAFYRALG
RQPRVYPPIV IHTVEAGELG GALDETLERL AGHLEREHEV EEKVKSALTY PAVVSLVAVG
AVIFLLTYVL PSFQVMLNSL QVPLPWPTRM ILGLSEGLRR WWFVLVLLLA GAGYGFYRWR
QGPDGRYQVD HLLLRLPIFG PVHQKTLLAR FSRTLGTLLH SGVPVLLALE VVRRTVGNAV
VARAVERAAE SVRDGQSLAA PLEASGIFPP MMVEMITVGE ETGALDAMLE RVAVLYELEV
EAVVSRLASL IEPVLIVGLG GIVGLIVISV FLPYFQMLGG IK