Gene Elen_0883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0883 
Symbol 
ID8415173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1080923 
End bp1082221 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content73% 
IMG OID645023848 
ProductFlp pilus assembly protein CpaB 
Protein accessionYP_003181245 
Protein GI257790639 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3745] Flp pilus assembly protein CpaB 
TIGRFAM ID[TIGR03177] Flp pilus assembly protein CpaB 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATA ACCGAGACGA GATCCAGCGA CTTCGCGCCA TGAGGGCCGA GGCCATCAAC 
CGCGGCGACT TCGGCCGCGC CGACGCGATC GGCGCCGACC TCGCGCGCCT CGAGTCCGCC
GCGGGCGGTA CCATCGACCC CATCACGGGC ATGCCCGCGG CATCGGCAAG CGTCCCGTCC
GCACCGCAGG GGGCGCCCTT CCCGCAGCGC CGCGACGAGC GACCCGCGCA GGGCCGTCCC
GCCTACGAGG ACCAGCACGG CCCCGCGCCG CGCCAATCGG CCCCGGCCCC GCAGCCCGCG
TCCCCGTCCG CGCCGCAGGG AGCCCCTGCA CCCCGCCGCG AGGCGCGCGA CGCCCGCCGC
ATGGAGGCCG ACGTCCACCA GCAGGAGAAT ACATACCGCC ACGAGCCCGC GCAGGAGAGC
GCATACCGCC CCGAGCCGAG CCGCCCGGCC TACGCCGGCG ATGACCGCGG TGCGAAAACC
GAGGCCGCTG CCGAGGAGCG CGGCGGTAGC CGCGAGGAGA AGCGCCGCGG GCGCTTCGGC
AAGGCCGAGG GCAAGAAGGA TGCCAAGCCC GCCGAGGGGC GCGATCCCTC GACCGCGCCG
CGGCGCGCGG CGCCCAAGCC CGCCCCCGCC GGCCCCTCGA AGGGCACGCG CGCGCTCACG
GTGGTGGCCG CCGCGGCCAT CGCCGTGTCG GTGGGAGCCA CCGTGTTCTC CGGCATGCGG
GTCGCGGAGT CCTCGGCGAT CATCGCCAAG AACGAGGCGA ATTCCGTTAA CGTCGTCGTG
ACCAACCGCG ACGTCGCCGC CGGCGAGACC ATCACCGAAG CCGACCTCGA GACGCAGGCC
GTCCCCAAGG CGTACTGCCC GACCGACGCC GCGACCAAGG TCTCGGATGT CGCCGGCCAC
ACCTCGCTCA CCACGCAGAC CGCCGGGACC TCCATCTCGC TGTCCTCCCT CCAGGCATCG
AGCTCGCCGG CGCACATCAC GTCGGCCATC GAGGACGGCC ATGTGGCCAT CGCCCTGTCG
CTCGACTCCT CCAAGAGCCT GTCGCCGCTG CTGCGCGTCG GCGACCGCGT CAACGTCATG
GCCGTCGTCT CCGACGGCGC GACGTCGAGC GCCGAGACGG TGTGCGCCAA CGTCAAGATC
ATCGCCCTCG ATTCCGCCCT GTCCGGCTCG CCGGACGCCG GGTACTCGCT CGTGACGCTC
GACGTCACCG AGGACCAGGC CGCGGCCATC GTGGCGAACC CGAACGTGAC GCTCACGGCC
ATCCCGCAGA CCGCCGAGGG GGCCAGCGAT GCTGAATAG
 
Protein sequence
MNDNRDEIQR LRAMRAEAIN RGDFGRADAI GADLARLESA AGGTIDPITG MPAASASVPS 
APQGAPFPQR RDERPAQGRP AYEDQHGPAP RQSAPAPQPA SPSAPQGAPA PRREARDARR
MEADVHQQEN TYRHEPAQES AYRPEPSRPA YAGDDRGAKT EAAAEERGGS REEKRRGRFG
KAEGKKDAKP AEGRDPSTAP RRAAPKPAPA GPSKGTRALT VVAAAAIAVS VGATVFSGMR
VAESSAIIAK NEANSVNVVV TNRDVAAGET ITEADLETQA VPKAYCPTDA ATKVSDVAGH
TSLTTQTAGT SISLSSLQAS SSPAHITSAI EDGHVAIALS LDSSKSLSPL LRVGDRVNVM
AVVSDGATSS AETVCANVKI IALDSALSGS PDAGYSLVTL DVTEDQAAAI VANPNVTLTA
IPQTAEGASD AE