Gene Moth_1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1554 
Symbol 
ID3832187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1596760 
End bp1598421 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content62% 
IMG OID637829486 
Producttype II secretion system protein E 
Protein accessionYP_430406 
Protein GI83590397 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E
[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.185461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGTC GACGACGACT GGGGGACCTG TTGATCGAAG CCGGGATGCT TACCCCGGCC 
CAGCTGGAAC AGGCCCTGCA GGAACAGAAA CGCAGCGGGG AGCGCCTGGG TAAGGTTTTA
ATCCGCCTGG GATTTATCAC CGAGGCCAGC ATGCTGGAGG TCCTGGAGTT CCAGCTGGGG
ATCCCCAAGG TGGTCCTGGC TGACTACCAC CTGGATCCGG AGGTGGTCCG CCTGGTGCCG
GAAGGCCTGG CCCGGCGCTA CCAGGCCATC CCCATCCGCC TGGACGGCAA CCGCCTCCTG
GTGGCCATGG CCGATCCCCT GAACCTCGTG GCCCTGGACG ACCTGCGCCT GGTCACCGGC
AAGGAGATTA TGCCGGCTAT AGCCGCCGAG AAGGAAATCG AGGCAGCTTT AAGCCGGTTC
TGGCAACGGG AACCCGTTAC GAGCATGAGC GAAGTAGCGG CAGCCGTCGC CGCCGCGGAA
TCTGGCGGGC GCGCCGGCGG CACGGAAGGC GCGCCGGCTG TGCGCCTGGT CAACAGTTTT
ATCCAGCAGG CCATCCAGAC CCGGGCCAGC GACATCCATA TAGAGCCCCA GGAGGGGGAG
GTCCGGGTGC GCCTGCGGGT AGACGGCCTG CTGCGGGAGT TGACCCGCCT GCCCCTGGGG
GTTTTAAGTA GCCTGATCTC CAGGATCAAG ATCATGGCCG GCATGGACAT CGCCGAAAAA
CGCTTGCCCC AGGACGGCCG TTTTCAGTTT ACCCTGGGTA AACGCAGTGT CGACCTCAGG
GTTTCCAGCC TGCCTACTGT TTACGGCGAA AAGATCGTCC TGCGCCTCCT GGACCAGGAG
GCCATGCTCC TGCCCCTGGA CGACCTGGGA TTTTTGCCGG CCATAAAAGA ACGCTTTGAG
AGTCTCATCC ACAGTTCCTA CGGCATGCTC CTCATTACCG GTCCCACGGG CAGCGGTAAG
ACGACGACCC TTTATGCTAC TCTTAACATT TTAAGCTCGC CGGAAAAAAA TATCATTACC
ATTGAGGATC CGGTAGAATA CCTGCTGCCC GGCATCAATC AGGTGCGGGT TAACCCCAAG
GCCGGCCTGA CCTTTGCTTC AGGGCTGCGT TCCATCCTGC GTCAGGACCC GGATATCATT
ATGGTCGGGG AGATTCGCGA CCGGGAGACG GCCGATATCG CCGTCCGGGC GGCGACTACC
GGTCACCTGG TCTTAACGAC CCTGCACACC AATGACGCCG CCGGCGCCGT AACCCGCCTC
CTGGATATGG GAGTGGAAGG CTACCTGGTC AATTCCTCCC TTATTGGCGT GGTGGCCCAG
CGCCTGGTGC GCCGCATCTG TCCCCATTGC CGGGAGATGT ACGAGCCGGA GCCGGGCTCT
CCGGAAAGGG CCTGGTTGCC GGGCGCGGAA CGGCTCTGGC GCGGCCGGGG TTGCGAAAAC
TGCCATTATA CCGGTTACAC CAACCGGACG GCCATCCAGG AGGTCCTGGT CATGAATGAA
GAACTCCGGC GCCTGGTAGC CGCCAAGGCG CCGGCTACGG CCCTGAAGGA GGCAGCGGTG
GCCGGCGGTA TGGTTCCTTT GATTGACGAC GGTTTGGAAA AAGCCCGCCA GGGGATCACT
ACGGTGAGCG AGGTCCTACG CGTTTCCCTG GGAGGTTTGT AA
 
Protein sequence
MDSRRRLGDL LIEAGMLTPA QLEQALQEQK RSGERLGKVL IRLGFITEAS MLEVLEFQLG 
IPKVVLADYH LDPEVVRLVP EGLARRYQAI PIRLDGNRLL VAMADPLNLV ALDDLRLVTG
KEIMPAIAAE KEIEAALSRF WQREPVTSMS EVAAAVAAAE SGGRAGGTEG APAVRLVNSF
IQQAIQTRAS DIHIEPQEGE VRVRLRVDGL LRELTRLPLG VLSSLISRIK IMAGMDIAEK
RLPQDGRFQF TLGKRSVDLR VSSLPTVYGE KIVLRLLDQE AMLLPLDDLG FLPAIKERFE
SLIHSSYGML LITGPTGSGK TTTLYATLNI LSSPEKNIIT IEDPVEYLLP GINQVRVNPK
AGLTFASGLR SILRQDPDII MVGEIRDRET ADIAVRAATT GHLVLTTLHT NDAAGAVTRL
LDMGVEGYLV NSSLIGVVAQ RLVRRICPHC REMYEPEPGS PERAWLPGAE RLWRGRGCEN
CHYTGYTNRT AIQEVLVMNE ELRRLVAAKA PATALKEAAV AGGMVPLIDD GLEKARQGIT
TVSEVLRVSL GGL