Gene Elen_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2266 
Symbol 
ID8416590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2662890 
End bp2664440 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content70% 
IMG OID645025252 
Productprotein of unknown function DUF92 transmembrane 
Protein accessionYP_003182615 
Protein GI257792009 
COG category[S] Function unknown 
COG ID[COG1836] Predicted membrane protein 
TIGRFAM ID[TIGR00297] conserved hypothetical protein TIGR00297 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGCG ACGTGATGGA GAACCTGATC GGCTTGGGCG TCTCGCTCGC GTACGTACTG 
GCGGTGCTGG GAGCGTCCAG CCTGGCCGCG CGGCGTGGCG CGTCGTCGGA GGCAACCCGC
AAGTTCGTGC ATATCGCGTT GGGCGGCTGG TGGCTCATCG CGGCCCGGTT CTTCGATTCG
CCTCTGTGGG CGGCGGCGCT TCCTGCTGCG TTCATCCTTG TGAACGCGTT TGCGTACCGT
CGGCAGAAGC TGTCGTTCAT GGGACGCGAC GGCGGCGAGG ACACGCCGGG CACGGTGTAC
TACGCGGTGT CGCTGACGGC GCTCGCGCTG TTCTCGTTCG GTATCGGCAC GCCGTACGTG
GGCGCGCTCG GCTTCTTCTG TATGGCGTTC GGCGACGGGT TCGCAGCTGT GCTGGGCAAG
CGGTTCGGAC GACGCGTGCT GGTGGGATGC TGCGGGAAGA CGCTGGTGGG AAGCGCGACC
ATGCTGGCGG TGAGCTTCGC TTCCTGCGCC GTCGTGCTGA TGGCGCCGCC GCCTTTCGGT
GCGGGCGGCA TCCTGGGCGC GCCGGGCGGC GCGTTCGCCC CGCTGGGCTC GCTCGCGGCG
TCGCTTCTGG CCGCGGCCCT GCTGGCCGCG GTCGCCGCTG CCATCGAGGC GTTCTCGGTG
GAGGGGCTCG ACAACCTGTT CGTGCCGCTG GGCGTGTCGG CGCTGTACGC GGTGCTGTTC
CTGCCCGCAG CCGCCTACAC GCCCGCGCTC GCGGGATTGC TGCTGTCGGG CGCGGTGGCG
CTCGCGTCGT TTCGGCTGCG GCTGCTCACC GTGGCCGGCG GCCTCGGCGC CGTGGCGGTG
GGCACGCTCG CGTTCGCTAT CGGCGGGTGG CCGCTGTGGC TGCTGCTCAT GTGGTTCTTC
GGCAGCTCGA ACGTCGCGTC GAAGCTGATG GCGCTTTCGG CGGTCAAGCG GAACGGCGGG
GCGCCCGCTT CGCGTAAGCA CAGCGGCCCG CGTACGTTGC GGCAGGTGCT GGCGAACAGC
GTGCCGTTCC TCGCGTGCGC GCTGGCGTAT ACGGCGACGG GGGAGCCGTG GCTGCTGCTT
CTGGCGTCCG GCGCTCTGGC GGCCAGCACG GCCGACACGT GGGCGTCGGA GGTGGGCGTG
TACAGCCGCC GGCCGCCGGT GAACATCCTC ACGCGCGAGC CCATGCAGCG CGGGCTTTCG
GGCGGCGTGA GCCCGTTGGG TCTCGCGGCC ACCGTGGTGG GAGCCGTAAC CTCGGCGTTT
CTGGCCATGC TGCTGTTCCA TGCGTTCGGC TACGCGATTC CCACCGGGCC CGACGCGTTC
TTCTTCATCA TCGCGTGCGG CGTCGTGGGC TCGCTCGTGG ACAGCGTGCT GGGCGTGGTC
ATGCAAGCGA AGTACCGCTG TCCGAACGAC GCTGAGGGAG GGCTTGTGGA AACGCCGCCG
TGCGGGGCCC AGGCCGCGCT CGTGTCCGGC TACGCCTGGG TCACGAACGA TGCCGTCAAC
CTCATGAGCG GCATCGCCGT CGTGCTGCTC GGCCTGCTCG TAGTGGTGTA G
 
Protein sequence
MMGDVMENLI GLGVSLAYVL AVLGASSLAA RRGASSEATR KFVHIALGGW WLIAARFFDS 
PLWAAALPAA FILVNAFAYR RQKLSFMGRD GGEDTPGTVY YAVSLTALAL FSFGIGTPYV
GALGFFCMAF GDGFAAVLGK RFGRRVLVGC CGKTLVGSAT MLAVSFASCA VVLMAPPPFG
AGGILGAPGG AFAPLGSLAA SLLAAALLAA VAAAIEAFSV EGLDNLFVPL GVSALYAVLF
LPAAAYTPAL AGLLLSGAVA LASFRLRLLT VAGGLGAVAV GTLAFAIGGW PLWLLLMWFF
GSSNVASKLM ALSAVKRNGG APASRKHSGP RTLRQVLANS VPFLACALAY TATGEPWLLL
LASGALAAST ADTWASEVGV YSRRPPVNIL TREPMQRGLS GGVSPLGLAA TVVGAVTSAF
LAMLLFHAFG YAIPTGPDAF FFIIACGVVG SLVDSVLGVV MQAKYRCPND AEGGLVETPP
CGAQAALVSG YAWVTNDAVN LMSGIAVVLL GLLVVV