Gene Moth_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0335 
Symbol 
ID3831579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp339322 
End bp340680 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content55% 
IMG OID637828270 
Producttype II secretion system protein E 
Protein accessionYP_429212 
Protein GI83589203 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGTA TTGTCAAAAA CGACAGGGCG GATAAGGCGG ATAAAAAAGA GGAATACCGG 
CTAGGCAAGC GCACCCTGCA AGAAGCAACC AAATTTGTCC AGGACATCAT TACCAACAGC
GAGGTGTGGG GAGAAGAAGC CTTCAGACAT AAAGAAATAC TTGAAGACGC CCAGGCCGGG
CTGCCCGGGG CGATAGAAAA GGCCAGGGAG CTTATAAAAG AAATCCTGGA TAAATACCAG
GTGGAAGTAG AGGGGATGAC AAGAGAGCAA CTGGCCAGGG AAATATTCAG CTACGCCTGG
GGGCTGGACG TGCTGGAAGA AGCGTATTAC GACCCTGAAG TGGACGAGAT CCGGGTCAAC
GGCCCCTCTG CCGTTTTTAT CCAAAAGCGG GGCAAAAACG TAAAAACCGG TATCAAGTTC
AAGGATGCCG AGCACGTCAA GAAGATTATC GCCAGGCTCC TGTTTCACGA CCGGGGTGTG
GCCCTGACGG CCTCGACGCC CATAGTCGAG TCCATCCGCA AGGACGGCAC CCGGTTGACG
GCCACCTGCC CGCCGGTGAC CCGAGAGTGG ACCATGGTCC TGCGGAAGCA TGACACCTTC
CAGATGACTC CGGAAAACCT CATCAGGGCC GGGACCTTGA ATCAGGAGCT GCTGGACCTC
CTGATTACCA TGGTCAGGGG CCGGGCCAAC ATTCTTATCT CCGGCGGCGT CGGCAGCGGC
AAGACCTCAT TGATGCGCTT TTTAATCAGC TATATCCACG AGATACTGCG GATAGTAACC
CTGGAAACCG ACGTTGAGCT GCGTTTATGC GAACACTACT GCGGCCGGGA CATAATCGAG
CTTGAGGAAC ACGCCGATTT AAACTGCGAC ATGAAAAAGC TTTTTCGTAC AACCCTGCGT
TACTCCCCGG ACATCATCAT GGTGGGCGAG ATCCGGGGCA TGGGTGAGGC GGTGGAAGCC
ATCAAAGCCT GCACAAGGGG CCTGCATGGC TCTATGGCGA CGATCCACTT CGGATCTCCC
TATGAGGCCG TGACCGGCTG CGCCAAGATG ATGCTGGAGG AAGGGCTGAA CCTGCCCCTG
GAGATAGCCG AAACCTGGGT GGCCGACGCC TTCGACGTGA TTATCCAGAT GTTTGCCGAC
AGCACCCGGG GGATAAAGAA GATCGTCCAG GTGACTGAAG TATGGCCGGA AAAGAGAGGC
GTCAATTTTC ACGATCTCGT TGTCTGGCGG CCCAGTAAAT ATGATTACTT TGAAGGTGAA
TGGGAGTTCG TCAATCCTCC CAGCGAGCGC TTGCAGGAAA AGTTGTTTAA ATACGGCGTT
AACATGTCCC GGTTTTCCAG TAAGGCGGGT GCTGCCTAA
 
Protein sequence
MFGIVKNDRA DKADKKEEYR LGKRTLQEAT KFVQDIITNS EVWGEEAFRH KEILEDAQAG 
LPGAIEKARE LIKEILDKYQ VEVEGMTREQ LAREIFSYAW GLDVLEEAYY DPEVDEIRVN
GPSAVFIQKR GKNVKTGIKF KDAEHVKKII ARLLFHDRGV ALTASTPIVE SIRKDGTRLT
ATCPPVTREW TMVLRKHDTF QMTPENLIRA GTLNQELLDL LITMVRGRAN ILISGGVGSG
KTSLMRFLIS YIHEILRIVT LETDVELRLC EHYCGRDIIE LEEHADLNCD MKKLFRTTLR
YSPDIIMVGE IRGMGEAVEA IKACTRGLHG SMATIHFGSP YEAVTGCAKM MLEEGLNLPL
EIAETWVADA FDVIIQMFAD STRGIKKIVQ VTEVWPEKRG VNFHDLVVWR PSKYDYFEGE
WEFVNPPSER LQEKLFKYGV NMSRFSSKAG AA