Gene Elen_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0042 
Symbol 
ID8414321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp55597 
End bp57303 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content65% 
IMG OID645023017 
Producttype II secretion system protein E 
Protein accessionYP_003180425 
Protein GI257789819 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACA AACGGTTGGG AGACGTGCTG ATAGACGCGG GCCTCATCAC CGAGGACCAG 
CTGGGACACG CGCTCAAACA GCAGAAAGAG ACGAAACGCC GCCTGGGCGA CGAGCTCATC
GCCGAAGGCG TCATCACGGA AGCGGGCCTC ATCGAGGCGC TGCAGATGCA GCTGGGCGTG
GAGTTCGTCG ACCTTTCGGC GATCGACCTC GACCCCGAGC TAAGCCGCGT GATCAGCAAG
AACGTCGCAC GCCAGTACAA CGTGGTGCCC GTGCGCACTT CGCCCGACGA GGTGTGCCTG
GCCATGAGCG ACCCGCTGAA CTTCATGGCC ATCGAAGCGG TGAAGAACGC CACTCGGAAG
CGCGTCATCC CTATGGTGAC CACGCATGAT TCGCTGATGC GCGCCATCAT GACGCTGTAC
GGCAACGAGG GTGCCGCCCG CGCCATCGAG GAGATGAAGC GCGACGCCCG CACGACAGGC
GCCGACGACG CCTCGACCGG ATCGTTCCAA ACCTCCACGC TGGGCGACGA CGCCGACGCG
CAATCGGCTC CCACGGTGCG TCTCGTGAAC AGCATCATCG AGCGAGCCGC CACCGAGCGC
GCCAGCGATA TCCACCTGGA GCCGCGCGAG ATCGACCTGC ATGTGCGCAT GCGTATCGAC
GGCGTGCTGC GCACGATCCT CACGGTGCCG AAGGAGCTGC AGGCTTCGGT CATCTCGCGT
TTGAAGATCA TGGGCGGTAT GAACACCTCG GAGCGCCGCG TGCCTCAGGA CGGCCGCGCA
AACATCCGCT TGAAGAAGCA GGACATCGAC CTGCGCATCA ACACGCTGCC CACCATCCAC
GGCGAGACGG TGGTCATCCG CCTGCTGGAC AAGAGCGAGG CGCTGTTCGA CCCGGCGGGC
ATCGGCCTGG AGGGCGACAA CCTGGAGAAG TACCAGCGGC TCATCGGCTC GAACAACGGC
ATGGTGCTGA TCGTGGGCCC CACCGGCTCG GGCAAAAGCT CCACCATGTA CACGATGATC
CGCCAGCTGA ACACCGATTC GGTGAACCTC GTCACGCTGG AAGACCCTGT GGAATACAAC
ATCGACGGCG TGAACCAGGT GCAGATCAAC GAGAAAACGG GCATGACCTT CGCCAGCGGC
CTGCGCGCCA TCCTGCGCCA GGACCCCGAC ATCGTGGCGG TGGGCGAGAT TCGCGACGGC
GAGACGGCCG AGATCGCCAT GCGCGCCGCC ATCACCGGCC ACCTGGTGCT GTCCACCGTG
CACACCTACG ACGCGGCGTC CACCATCGAC CGTCTCATCG ATATCGGCGT GGAGCCGTAC
CTCATCGCCA GCGGCGTGCG CGGCGTCATC TCGCAGCGCC TCGTGCGCAA GGTGTGCCCC
CATTGCCGCG AGGAGTATCG GCCCAGCCCC GAGGAGTTTG AGGCTATCGG CCTGCGATAC
GATCCCGGCG TGAGGTTCTA CCGCGGAGCC GGGTGCCCCA TGTGCTTCGG CACGGGCTAC
CGCGGGCGCA CGGGCGTGTT CGAGATCCTG GTGATCGACC GCGAGCTGCG CAGTCGCATC
ACGGGCGGCG CCACGCGCGA GGAGCTGAAG GACGCCATCG AGCGCACGGG CTCGTTCAAG
ACGATGGAGG ACAGCTGCCG CGAGCTGGTG CTGACCGGCG TGACCACCGT CGAGGAAGCC
CGCAAAACCA TCACCGCGCT GGAGTAA
 
Protein sequence
MAYKRLGDVL IDAGLITEDQ LGHALKQQKE TKRRLGDELI AEGVITEAGL IEALQMQLGV 
EFVDLSAIDL DPELSRVISK NVARQYNVVP VRTSPDEVCL AMSDPLNFMA IEAVKNATRK
RVIPMVTTHD SLMRAIMTLY GNEGAARAIE EMKRDARTTG ADDASTGSFQ TSTLGDDADA
QSAPTVRLVN SIIERAATER ASDIHLEPRE IDLHVRMRID GVLRTILTVP KELQASVISR
LKIMGGMNTS ERRVPQDGRA NIRLKKQDID LRINTLPTIH GETVVIRLLD KSEALFDPAG
IGLEGDNLEK YQRLIGSNNG MVLIVGPTGS GKSSTMYTMI RQLNTDSVNL VTLEDPVEYN
IDGVNQVQIN EKTGMTFASG LRAILRQDPD IVAVGEIRDG ETAEIAMRAA ITGHLVLSTV
HTYDAASTID RLIDIGVEPY LIASGVRGVI SQRLVRKVCP HCREEYRPSP EEFEAIGLRY
DPGVRFYRGA GCPMCFGTGY RGRTGVFEIL VIDRELRSRI TGGATREELK DAIERTGSFK
TMEDSCRELV LTGVTTVEEA RKTITALE