Gene Elen_0904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0904 
Symbol 
ID8415194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1102992 
End bp1104311 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content66% 
IMG OID645023869 
Producthypothetical protein 
Protein accessionYP_003181266 
Protein GI257790660 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCA GGCAGCCCGA GCTCTCGCGG CTCACCCTCA TGTGCGGCGA CCACGAGGTC 
GCCGAGTTCA CCTGGAACCA TGACCGCCAG GCCGTGACCG GCAAGACCCA CGTCCTCGAT
GCGGCGCATG CCCCCCTTAT GGCGACCGAC CCATCGGGCA ACATCACGCG CGACAGGCTC
TCGAGCTGGT TCAAGAACAG GGGCATCCCC GATTTCCGCC CCGATGCCGC GGACCGCCTT
CGCGCCGTCG GCTACCCGTC GGCCGCCTCC CTCATGGCCT CTGGTTTCGG CGCATCGCTG
TCCGACCAAT ATTGGATCCG CCCCGCCGGA TCAGCATCTA CATGGCGCGA CGTCAACTGC
TTCGAGAACG ATTTCAGCGA AGAGCTCGGC GAGCTGCTTC TTCCGCACGA CGCCTCGTCG
GTCCCCTCCC TCATCGAGAA GATCAGGGGC AACGCCGACC TCCTCGCCTC ATCACCCGAT
GCGGCCCTGA ACGGCAACCT GCCGAAACGC TGGACGATCG AGGGCGGGCA GCGGATGCTG
GTCAAGTCCG GGCGATCCTC CGGCAGGTTC CAGGAACCGT TCAACGAGAA GATCGCCAGC
GTCCTGTGCT CGCGGCTGCT CGACGAGGGC GACTATGTCG CCTACGAGCT CGAGGACGGC
GGCTTCATGA AGTGGACCTC CCGCTGCAAA CCCATGACCG ACCAGGTCAC CGAGTTCGTG
CCGGCCTGGG CGCTGCTCTG CTCGTCGAAA CGCCCCTCGG ACCTCGGGCT GCACGATTTC
TACGTGTCCG CCTGCGCCGC CCACGGGCTC GACGTGCGCG AGGACGTGGA GAAGATGCTC
GTCATCGACT ACCTCATGGC GAACTTCGAT AGGCACTGGA ACAACTTCGG CGTGCTCATC
GACAGCGAGA GCAGGGAATG GCTCCGCGCG GCGCCGGTGT TCGACACCGG CGAAGCCCTC
TGGTGCGACC GCGAGCTCTC CCAGCCCTTC GACGGCTACA CGACCCCGCG GGCCGGCATG
ATGCGTCCCT TCGCCCGCAA GATCGACGAA CAACTCGGCA GATATTGCCG CGACCTGTCC
TGGTTCGACC CGTCCGGGCT CAAGGGCTTC TCGGAGGAGG CCTGCGACAT CCTCCTCGGC
AACCCGTTCA TCGCCAACGA GCGCGGCAGA ATCGACAAAA TCAGGGAAGC GATCGACCTG
CGCGCCATGA TGCTGACCAG GCACGCCCAT GAGATCAGCG GCGGGCGCGG GTTCATGGTT
CCCGACATGG GCGCCGTTTG CGCGACCGGA GCGGCGAAGC GGACGGGGCT CCATCTGTAA
 
Protein sequence
MDSRQPELSR LTLMCGDHEV AEFTWNHDRQ AVTGKTHVLD AAHAPLMATD PSGNITRDRL 
SSWFKNRGIP DFRPDAADRL RAVGYPSAAS LMASGFGASL SDQYWIRPAG SASTWRDVNC
FENDFSEELG ELLLPHDASS VPSLIEKIRG NADLLASSPD AALNGNLPKR WTIEGGQRML
VKSGRSSGRF QEPFNEKIAS VLCSRLLDEG DYVAYELEDG GFMKWTSRCK PMTDQVTEFV
PAWALLCSSK RPSDLGLHDF YVSACAAHGL DVREDVEKML VIDYLMANFD RHWNNFGVLI
DSESREWLRA APVFDTGEAL WCDRELSQPF DGYTTPRAGM MRPFARKIDE QLGRYCRDLS
WFDPSGLKGF SEEACDILLG NPFIANERGR IDKIREAIDL RAMMLTRHAH EISGGRGFMV
PDMGAVCATG AAKRTGLHL