Gene Moth_0324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0324 
Symbol 
ID3831568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp328684 
End bp330021 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content53% 
IMG OID637828259 
Producttype II secretion system protein E 
Protein accessionYP_429201 
Protein GI83589192 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0186517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAAAA TCGCTTTAGA ATTTCAATCC CATGACTGGA AGGCCGAGAT TGAGCGCCAG 
CGCATGAAGA TCATCGACGA GCTGGCAGCC CAAATATCGC GGGAGCATCC CGAGATATTT
TCTGACCAGC TAGTAGACGA GGGGTATGCT GCGGCGGAAA AGATTGTAAA AAATTATGTC
CTGACCCGCA AGGATTTGGC TCCCAGCGAA GTTGAAGGGG TAGTAAAAGC GATTATGGGT
CAGGCTACAA GCTACGGGCC GTTGCAGGAA TTTTTTGTGG GCAAAGAAGC CAAAGAGATA
ACGGAGGTGA TGGTTAACCC TTCAAAGGAC GGTCCCAGAG TGTTCTATGG TAAGCATGGG
AAATTGCACG ACGCGGGAAG GGGGTATTTT AAAGACAACG ACGAGGTGAC CCGCTACTGC
CAGAAGATAT GCGAAGACGT GGGCAGGCCC TTCACCGAAG ACGCCCCTAT AGTTGACGCC
TGGCTGAAGG ACGGCAGCCG GGTGGCGGTG ATGGGTTACA AGGTAAGCCC ATTGGGTACG
GCCCTGACTA TAAGGAAGTC TCCCTTGCTG CGGCCGCCGA TGCCCCTGGA AAAGCTGGTA
GAGTATAAGA TGCTGCCCTC GCTGGCGGCT TCCATGATGG TTGATCTTTT GGTAAAGGGG
CACGCCAATA TAGGCATCTT CGGGAGGACA GACAGCGGTA AAACCACCTT TGCCCGGTCC
TTAGCCCAGC ATATTGACCC GCAGGAGAGG GTGTTTATCG CTGAAACAAG TTTTGAGATG
TACCTGCCCA ACCTGCCCAA TTGCATCAAT CTGGTAGAGC TGGTTTACGG CGATAAGACG
ATAGTGGATA TGACCCAGCT ATGTAAGACC ATGAACAGGA ATAACCCCGA CCGGAGCATA
GTGGGCGAGA TCAGGAGCAG GGAGATAATC GCCGCCTCCC AGATAGCTTC TTCGACATCC
GGGGGGTTCT GGACAACGGG CCACGCCGGC GATGTCAACG ACCTGCGGAC GCGGTTATTC
GGTATGTTCC TGGATGGCGG TGTCCAGCTG CCGGTAGAAT TTCTTGATGA AATCATCAGG
TCTATGTTTC ACTTCCTGGT TTTCTTAGAT AAAAGTTTTG ACGGCATGAG GACTTTAATG
TCGCTTGTGG AGGTGACGCC GGAAGGCTAT CGGACGATCA TCAGGTACGA TACGAAAGCT
TTTGCTGCTT CACGCGGCAA AGAGCGGCGC TGGATTTACG AGAATACTGT TACGCCTGAA
AAGATGGGCA AGCTGGCGTT CAGCGGGGCG GAATTGAAGC CAGAATACGA AAAGGTGCCG
GAAAAGTACC TCTGCTGA
 
Protein sequence
MVKIALEFQS HDWKAEIERQ RMKIIDELAA QISREHPEIF SDQLVDEGYA AAEKIVKNYV 
LTRKDLAPSE VEGVVKAIMG QATSYGPLQE FFVGKEAKEI TEVMVNPSKD GPRVFYGKHG
KLHDAGRGYF KDNDEVTRYC QKICEDVGRP FTEDAPIVDA WLKDGSRVAV MGYKVSPLGT
ALTIRKSPLL RPPMPLEKLV EYKMLPSLAA SMMVDLLVKG HANIGIFGRT DSGKTTFARS
LAQHIDPQER VFIAETSFEM YLPNLPNCIN LVELVYGDKT IVDMTQLCKT MNRNNPDRSI
VGEIRSREII AASQIASSTS GGFWTTGHAG DVNDLRTRLF GMFLDGGVQL PVEFLDEIIR
SMFHFLVFLD KSFDGMRTLM SLVEVTPEGY RTIIRYDTKA FAASRGKERR WIYENTVTPE
KMGKLAFSGA ELKPEYEKVP EKYLC