Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0335 |
Symbol | |
ID | 3831579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 339322 |
End bp | 340680 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637828270 |
Product | type II secretion system protein E |
Protein accession | YP_429212 |
Protein GI | 83589203 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGGTA TTGTCAAAAA CGACAGGGCG GATAAGGCGG ATAAAAAAGA GGAATACCGG CTAGGCAAGC GCACCCTGCA AGAAGCAACC AAATTTGTCC AGGACATCAT TACCAACAGC GAGGTGTGGG GAGAAGAAGC CTTCAGACAT AAAGAAATAC TTGAAGACGC CCAGGCCGGG CTGCCCGGGG CGATAGAAAA GGCCAGGGAG CTTATAAAAG AAATCCTGGA TAAATACCAG GTGGAAGTAG AGGGGATGAC AAGAGAGCAA CTGGCCAGGG AAATATTCAG CTACGCCTGG GGGCTGGACG TGCTGGAAGA AGCGTATTAC GACCCTGAAG TGGACGAGAT CCGGGTCAAC GGCCCCTCTG CCGTTTTTAT CCAAAAGCGG GGCAAAAACG TAAAAACCGG TATCAAGTTC AAGGATGCCG AGCACGTCAA GAAGATTATC GCCAGGCTCC TGTTTCACGA CCGGGGTGTG GCCCTGACGG CCTCGACGCC CATAGTCGAG TCCATCCGCA AGGACGGCAC CCGGTTGACG GCCACCTGCC CGCCGGTGAC CCGAGAGTGG ACCATGGTCC TGCGGAAGCA TGACACCTTC CAGATGACTC CGGAAAACCT CATCAGGGCC GGGACCTTGA ATCAGGAGCT GCTGGACCTC CTGATTACCA TGGTCAGGGG CCGGGCCAAC ATTCTTATCT CCGGCGGCGT CGGCAGCGGC AAGACCTCAT TGATGCGCTT TTTAATCAGC TATATCCACG AGATACTGCG GATAGTAACC CTGGAAACCG ACGTTGAGCT GCGTTTATGC GAACACTACT GCGGCCGGGA CATAATCGAG CTTGAGGAAC ACGCCGATTT AAACTGCGAC ATGAAAAAGC TTTTTCGTAC AACCCTGCGT TACTCCCCGG ACATCATCAT GGTGGGCGAG ATCCGGGGCA TGGGTGAGGC GGTGGAAGCC ATCAAAGCCT GCACAAGGGG CCTGCATGGC TCTATGGCGA CGATCCACTT CGGATCTCCC TATGAGGCCG TGACCGGCTG CGCCAAGATG ATGCTGGAGG AAGGGCTGAA CCTGCCCCTG GAGATAGCCG AAACCTGGGT GGCCGACGCC TTCGACGTGA TTATCCAGAT GTTTGCCGAC AGCACCCGGG GGATAAAGAA GATCGTCCAG GTGACTGAAG TATGGCCGGA AAAGAGAGGC GTCAATTTTC ACGATCTCGT TGTCTGGCGG CCCAGTAAAT ATGATTACTT TGAAGGTGAA TGGGAGTTCG TCAATCCTCC CAGCGAGCGC TTGCAGGAAA AGTTGTTTAA ATACGGCGTT AACATGTCCC GGTTTTCCAG TAAGGCGGGT GCTGCCTAA
|
Protein sequence | MFGIVKNDRA DKADKKEEYR LGKRTLQEAT KFVQDIITNS EVWGEEAFRH KEILEDAQAG LPGAIEKARE LIKEILDKYQ VEVEGMTREQ LAREIFSYAW GLDVLEEAYY DPEVDEIRVN GPSAVFIQKR GKNVKTGIKF KDAEHVKKII ARLLFHDRGV ALTASTPIVE SIRKDGTRLT ATCPPVTREW TMVLRKHDTF QMTPENLIRA GTLNQELLDL LITMVRGRAN ILISGGVGSG KTSLMRFLIS YIHEILRIVT LETDVELRLC EHYCGRDIIE LEEHADLNCD MKKLFRTTLR YSPDIIMVGE IRGMGEAVEA IKACTRGLHG SMATIHFGSP YEAVTGCAKM MLEEGLNLPL EIAETWVADA FDVIIQMFAD STRGIKKIVQ VTEVWPEKRG VNFHDLVVWR PSKYDYFEGE WEFVNPPSER LQEKLFKYGV NMSRFSSKAG AA
|
| |