Gene Elen_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0110 
Symbol 
ID8414393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp146565 
End bp149765 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content66% 
IMG OID645023089 
ProductCna B domain protein 
Protein accessionYP_003180493 
Protein GI257789887 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCCC AGCCAACTGT TCCTACCTGT CCGGTTTCGC GCGCGGGTCG CTTGCGCGCG 
TTCTTCGTTC TGATAGGGCT GGTCGCCCTC TGTCTTCTGG CGTTCCGCTC GCCGGCGTTC
GCCGAGGAAG CCCAGCCCGA TCCGTCCGAC GACGCGCCTA CGCCGCGCGC GGCGATGGAC
GTGACCGACA AGGTGTCCGT CGACGGCGTC AAGCTGCAGA AGAAATCGGG CGACAGCTGG
TTGGACATTC CCGCCGGCAC CACGCTGACC AGCGGCGCGT ACGTGCGCAT CTATATCGAC
TGGAGCATTC CCGATATGAC GGACGTGCAT GCGGGCGACA CGTTCACGTT CACCGTCAAC
GGGGATGATC ACTTCCTGGC CAGCGATTTC GGCCCGGTGA ACCTCGTCGA TCCCACGACC
CAGAAGGTGA TCGGCTCGTA CGTGGTCAAC GGCAACCGCG ATGCGAACGG CAGCATGATC
CCCGGTCAGG ACATCACTAT CGTGACCACG CTTTCCGACG ATGGGGCGCA GTTCCCCTCG
CTGCACAATG GCTTCTTCTC GCTTGAGGGC TACGTGACGG GCCAAGGCAA CGATATCGTG
TTCACGGTTA ACGGCCAAGC GCTTCCGTCT ATCGGCGTCG AACCGCCTAC GACGGGCCCG
CTGCCCGATA CGCCGCTGAT GAAGTACGGC TCGCAGGTAG CCGGCCAGAA CCAGATCGTG
TGGAGCATCG GCGTGAACCT GGACAACATC GTGGACGCGT ACGCGAACTA TGCCGTGGGC
GGTTCCGACC CGTCGCCGCA GCAGCGCAAC CTGTTGCTGA CCGACGACTT GCAGGGCGGG
CAGGCCATCA CGTCGGGCGG CGTGACGGTG TACATGCCCG TGGTGGCCAC CACCGACGCG
GGCGAGGCGC AGACCGAGCA GTACGCGGCC TATCCCATCA CCGACCTGTT CGCGCTGACC
GACAAGTCGG CGGCCGACGG CATGACGCAG GACGAGTTCG CGAGCAGCGT GCGCATGGCA
GCCGTGCCCA CCATCGGCGT GTGGGAGAAC CGCGTGGTGT ACATCGGGTT CGGCAACGTG
CCCACGGGCA ATGGGTCGAG CCCGCTCAAT ATCGGAAACA TCCTGGACGG CCAGGGCGAG
GCGGGACTTT CCTGGCTGCT GGAAGAAAAA GGCGCCACGC CTGCTCAGAA AGCCATCATC
ATGAAGTACT TCAGCGCCTC GGGGCCGAAC AAGGGCGACT TCACGTCGTT CATCGTAGAG
CTGCGCAGCG ACGCGTCCGA AACCGGGCAG TACGAGAACA ACGCCACGCT CACCTATGGC
GACAGCGGGT CGGAAGCGGC GCCCGGAAGC GCGCATTTCA CCGTGATCTC GGGCGGCGTG
GAGGTGAAGG ACGGCAAGGC CGTTCTGAAG AAGGTGGACG CGGGCAATCC GGACGCCACG
CTGCCGGGCG CCGTGTTCAG GTTGGAGAAG ATGCAGCCTG ACAACACCTG GGCGACGGTT
GCGGGTTCCG AGCAGCTGAC TACGGATGGA AGCGGGCTGA TCACGGTGAC GGGGTTGCTG
TTGGGCCAGT ACCGTTTCGT GGAGATGGTG CCGCCCGTCG GGTATGAGAT GACGAGCGAG
TCGGTGGAAT TCGCCATCAC GTCGACCACG CTGAACCATA CCGCGAACGT TACGGCCGAG
AACAGAAGGG CCCCCGTGCT GGGCAAGGTC GTGCTGACGA AGGTGGACGC GGACGACGAA
TCTGTGAAGC TCCCGGGCGC CGTGTTCAAG CTGGAAGAGC AGGCGGCGGA CGGGTCGTGG
GTCGAAGTGC CGGGATACGA GTCGCTTGTC ACCGATGGCA GCGGCTTGAT CGAGGTGGAA
CGTCTCGCGA TGGGGGCGTA CCGCTTCGTG GAGGTTTCGG CTCCCGAGGG CTACGAGTTG
GAGACGGCGC CGGTGGAGTT CACGCTGGCC AAGGACGCGC CCGGGTTGGA GGTGGCCGTG
ACGGCTACGA ATACCAAGAG CCCGGTGTTC GGGAAGGCGG TGCTGAAGAA GGTTGACGCC
GAGGCTCCCG ATGCCGTGCT GCCGGGCGCG ACGTTCAAGC TGGAGCAGCG CCTCGCCGAC
GATTCCTGGG AGATCGTGCC GGGCCACGAT GCGCTTGTCA CCGACGCCGG CGGCCTGATC
GAGGTGGCCG ACTTGGCGGT GGGCAGCTAC CGCTTCGTGG AAACCGCGGC TCCCGAAGGC
TACGTGCTGG ACGACACGCC CCGCGAGTTC GCCATCGACG CCGCCGCGCC GGCGCCCATC
GTCGTGAATC TCACGGCTGC GAACGCCAAG GAACCGCCCG CGCCGCTGGG CAAGGTCGTG
CTGACGAAGG TGGACGCGGA TTCCCCGACG ACGGTGCTCC CGGGCGCCGT GTTCAAGCTG
GAGGCGCAGA CGGCCGACGG AAGCTGGGAG CCCGTGCCGG GTTCCCAGCG CCTGACCACG
GACGCCGACG GTCTGATCGA GGTAACCGAT CTGCCGATGG GCGCGTACCG CTTCGTGGAG
CTTTCGGCTC CCGAGGGTTA CGAGCTGGAG ACGGTGCCGG TGGAGTTCAC GCTGGCCGAG
GACGCGCCCG GCCTTGCGGT GGCCGTGACG GCTACGAACG TGAAGACCCC TGTGCTGGGC
GGCGCGCTGC TCACGAAGGT GGATGCCGAC GATGCGACGA CGGTGCTTGC CGGAGCGGTG
TTCAAGTTGG AGCAGCGTCT GTCCGACGGG ACGTGGGCCG TGGTCGACGG CTTCGACGCG
CTCGCCACGA ACGATGAGGG AATCATCGAG GTGCATGGTT TGCCGGTGGG AAGCTACCGC
TTCGTAGAGA CGGCCGCCCC CGAGGGCTAC GTGTTGGATG AGACGCCGGT TGAGTTCGGA
GTGGAAGTGG GCCAGCCCGA GCAGGTCGAA CTCACTGTGG AGAACGCTCC GGTGCCGCCC
GACCCGGTTG ACCCGGTGGA TCCCAACGAT CCCACCGACC CGACCGACCC GGCGAAACCC
GTCGACCCTG CCGCGCCGGG CACCCCATCG ACCTCGCAAC CGTCGACTCC GAATGCCCCG
GGCAAGACGT CCTCTGCATC GTCGATCGCC CGCACCGGCG ACGCTGTTCC GCTCGCCGTT
CTGGGCGCGC TGGGCGCAGT CGCGATAGGC GCGCTCGCCA CGGCGCTTTT GGTGGCGCGG
CGGCGTGCGG GTCGTCGCTA G
 
Protein sequence
MAPQPTVPTC PVSRAGRLRA FFVLIGLVAL CLLAFRSPAF AEEAQPDPSD DAPTPRAAMD 
VTDKVSVDGV KLQKKSGDSW LDIPAGTTLT SGAYVRIYID WSIPDMTDVH AGDTFTFTVN
GDDHFLASDF GPVNLVDPTT QKVIGSYVVN GNRDANGSMI PGQDITIVTT LSDDGAQFPS
LHNGFFSLEG YVTGQGNDIV FTVNGQALPS IGVEPPTTGP LPDTPLMKYG SQVAGQNQIV
WSIGVNLDNI VDAYANYAVG GSDPSPQQRN LLLTDDLQGG QAITSGGVTV YMPVVATTDA
GEAQTEQYAA YPITDLFALT DKSAADGMTQ DEFASSVRMA AVPTIGVWEN RVVYIGFGNV
PTGNGSSPLN IGNILDGQGE AGLSWLLEEK GATPAQKAII MKYFSASGPN KGDFTSFIVE
LRSDASETGQ YENNATLTYG DSGSEAAPGS AHFTVISGGV EVKDGKAVLK KVDAGNPDAT
LPGAVFRLEK MQPDNTWATV AGSEQLTTDG SGLITVTGLL LGQYRFVEMV PPVGYEMTSE
SVEFAITSTT LNHTANVTAE NRRAPVLGKV VLTKVDADDE SVKLPGAVFK LEEQAADGSW
VEVPGYESLV TDGSGLIEVE RLAMGAYRFV EVSAPEGYEL ETAPVEFTLA KDAPGLEVAV
TATNTKSPVF GKAVLKKVDA EAPDAVLPGA TFKLEQRLAD DSWEIVPGHD ALVTDAGGLI
EVADLAVGSY RFVETAAPEG YVLDDTPREF AIDAAAPAPI VVNLTAANAK EPPAPLGKVV
LTKVDADSPT TVLPGAVFKL EAQTADGSWE PVPGSQRLTT DADGLIEVTD LPMGAYRFVE
LSAPEGYELE TVPVEFTLAE DAPGLAVAVT ATNVKTPVLG GALLTKVDAD DATTVLAGAV
FKLEQRLSDG TWAVVDGFDA LATNDEGIIE VHGLPVGSYR FVETAAPEGY VLDETPVEFG
VEVGQPEQVE LTVENAPVPP DPVDPVDPND PTDPTDPAKP VDPAAPGTPS TSQPSTPNAP
GKTSSASSIA RTGDAVPLAV LGALGAVAIG ALATALLVAR RRAGRR