Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1554 |
Symbol | |
ID | 3832187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1596760 |
End bp | 1598421 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829486 |
Product | type II secretion system protein E |
Protein accession | YP_430406 |
Protein GI | 83590397 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E [TIGR02538] type IV-A pilus assembly ATPase PilB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.185461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGTC GACGACGACT GGGGGACCTG TTGATCGAAG CCGGGATGCT TACCCCGGCC CAGCTGGAAC AGGCCCTGCA GGAACAGAAA CGCAGCGGGG AGCGCCTGGG TAAGGTTTTA ATCCGCCTGG GATTTATCAC CGAGGCCAGC ATGCTGGAGG TCCTGGAGTT CCAGCTGGGG ATCCCCAAGG TGGTCCTGGC TGACTACCAC CTGGATCCGG AGGTGGTCCG CCTGGTGCCG GAAGGCCTGG CCCGGCGCTA CCAGGCCATC CCCATCCGCC TGGACGGCAA CCGCCTCCTG GTGGCCATGG CCGATCCCCT GAACCTCGTG GCCCTGGACG ACCTGCGCCT GGTCACCGGC AAGGAGATTA TGCCGGCTAT AGCCGCCGAG AAGGAAATCG AGGCAGCTTT AAGCCGGTTC TGGCAACGGG AACCCGTTAC GAGCATGAGC GAAGTAGCGG CAGCCGTCGC CGCCGCGGAA TCTGGCGGGC GCGCCGGCGG CACGGAAGGC GCGCCGGCTG TGCGCCTGGT CAACAGTTTT ATCCAGCAGG CCATCCAGAC CCGGGCCAGC GACATCCATA TAGAGCCCCA GGAGGGGGAG GTCCGGGTGC GCCTGCGGGT AGACGGCCTG CTGCGGGAGT TGACCCGCCT GCCCCTGGGG GTTTTAAGTA GCCTGATCTC CAGGATCAAG ATCATGGCCG GCATGGACAT CGCCGAAAAA CGCTTGCCCC AGGACGGCCG TTTTCAGTTT ACCCTGGGTA AACGCAGTGT CGACCTCAGG GTTTCCAGCC TGCCTACTGT TTACGGCGAA AAGATCGTCC TGCGCCTCCT GGACCAGGAG GCCATGCTCC TGCCCCTGGA CGACCTGGGA TTTTTGCCGG CCATAAAAGA ACGCTTTGAG AGTCTCATCC ACAGTTCCTA CGGCATGCTC CTCATTACCG GTCCCACGGG CAGCGGTAAG ACGACGACCC TTTATGCTAC TCTTAACATT TTAAGCTCGC CGGAAAAAAA TATCATTACC ATTGAGGATC CGGTAGAATA CCTGCTGCCC GGCATCAATC AGGTGCGGGT TAACCCCAAG GCCGGCCTGA CCTTTGCTTC AGGGCTGCGT TCCATCCTGC GTCAGGACCC GGATATCATT ATGGTCGGGG AGATTCGCGA CCGGGAGACG GCCGATATCG CCGTCCGGGC GGCGACTACC GGTCACCTGG TCTTAACGAC CCTGCACACC AATGACGCCG CCGGCGCCGT AACCCGCCTC CTGGATATGG GAGTGGAAGG CTACCTGGTC AATTCCTCCC TTATTGGCGT GGTGGCCCAG CGCCTGGTGC GCCGCATCTG TCCCCATTGC CGGGAGATGT ACGAGCCGGA GCCGGGCTCT CCGGAAAGGG CCTGGTTGCC GGGCGCGGAA CGGCTCTGGC GCGGCCGGGG TTGCGAAAAC TGCCATTATA CCGGTTACAC CAACCGGACG GCCATCCAGG AGGTCCTGGT CATGAATGAA GAACTCCGGC GCCTGGTAGC CGCCAAGGCG CCGGCTACGG CCCTGAAGGA GGCAGCGGTG GCCGGCGGTA TGGTTCCTTT GATTGACGAC GGTTTGGAAA AAGCCCGCCA GGGGATCACT ACGGTGAGCG AGGTCCTACG CGTTTCCCTG GGAGGTTTGT AA
|
Protein sequence | MDSRRRLGDL LIEAGMLTPA QLEQALQEQK RSGERLGKVL IRLGFITEAS MLEVLEFQLG IPKVVLADYH LDPEVVRLVP EGLARRYQAI PIRLDGNRLL VAMADPLNLV ALDDLRLVTG KEIMPAIAAE KEIEAALSRF WQREPVTSMS EVAAAVAAAE SGGRAGGTEG APAVRLVNSF IQQAIQTRAS DIHIEPQEGE VRVRLRVDGL LRELTRLPLG VLSSLISRIK IMAGMDIAEK RLPQDGRFQF TLGKRSVDLR VSSLPTVYGE KIVLRLLDQE AMLLPLDDLG FLPAIKERFE SLIHSSYGML LITGPTGSGK TTTLYATLNI LSSPEKNIIT IEDPVEYLLP GINQVRVNPK AGLTFASGLR SILRQDPDII MVGEIRDRET ADIAVRAATT GHLVLTTLHT NDAAGAVTRL LDMGVEGYLV NSSLIGVVAQ RLVRRICPHC REMYEPEPGS PERAWLPGAE RLWRGRGCEN CHYTGYTNRT AIQEVLVMNE ELRRLVAAKA PATALKEAAV AGGMVPLIDD GLEKARQGIT TVSEVLRVSL GGL
|
| |