Gene Elen_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1156 
Symbol 
ID8415446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1389086 
End bp1390297 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID645024118 
Productprotein of unknown function DUF1113 
Protein accessionYP_003181515 
Protein GI257790909 
COG category[S] Function unknown 
COG ID[COG4905] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.703838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAGC ACGACCGCGT CGAGAACGGA CTCGACCTCG ACGAAGGCGC GCCTGCGTCC 
AATAGCAAGA TCCCCCTTCC GCTCAAGGTG TTCGGCATCC TGTGCATAGT CAGCGGCGCA
GCGCTCGTGC CGGTGCTCGC GCTGCTCATC GTCGGCATGG TGATGGCCCT GCAACAGGGC
GCCATCGTCG AGGAGCTGTC CGCCGCCGCC CTCGTCATCT TCGTCTCCGA CGCGGTGCTC
ATGACGGTTT TGTCCGCCAT GTTCGTCATC CTGGGCATCC GACTGCTGCG CGACAAACGG
CGTCGCACGG CCCAGATCGC CGAGGTCATG ATCGTCATCC TCATCCTCGT GATCCTCTGC
GACATGATGC TGAGCGGCCT CACGCCCGAT CTCATCCCCT ACGGCGTGGT GCTGGTCGTG
CTCGTCGCGC TATCGAGCTA CGTGGACCCG TCGCTAGCCG AAGAGCGCGA GCTGCGCCGC
AAACTGCGCG ATATGGAAAC GCGCGAGGCG GCCGAGGAGG GCACGCTCGG CCGCGACGAG
ACCGGCAAGG GCTTCATCGC GCTCAACTTC TTCAACCTGT TCTGGATCTT CGTCGTGTGC
TGCGTGCTGG GACTCATGAT CGAGACGGTA TACCACTTCC TCGTCGTGAA TCCAGGGCAC
TACCAGGATC GCGCGGGTTT GCTGTTCGGC CCGTTCTCGC CCATCTACGG TTTCGGCGCG
GTGCTGATGA CGGTCGCCCT GAACCGATTC CACGACAAGA ACGTCGTGCT CATCTTCCTG
GTGAGCGCCG TCATCGGCGG GGCGTTCGAG TATCTGACCA GCTGGTTCAT GCAGTTCGCC
TTCGGCATCG TGGCGTGGGA CTACTCCGGC ACGTTCCTGT CCATCGACGG GCGCACGAAC
GGCATGTTCA TGGCCATGTG GGGCGTCTTG GGCGTGGTGT GGATCAAGCT GCTTCTTCCC
TGGATGTTGA AGCTGGTGAA CCTCATCCCT TGGAACTGGC GCTACGCGGT CACCACGGTG
TGCGCCGCGC TCATGATCGT CGACGGGGCG ATGACGCTGC TGTCGCTCGA CTGCTGGTAC
CAGCGCGAGG CTGGAAAACC TCCCGAGACG GCCGTGGCGC ACTTCTTCGC CGAGCACTTC
GACAACCAGT ACATGGAGAA CCGCTTCCAG AGCATGTCCA TCGATCCGGG CAACGCGACC
CGCGCGAAAT AG
 
Protein sequence
MGKHDRVENG LDLDEGAPAS NSKIPLPLKV FGILCIVSGA ALVPVLALLI VGMVMALQQG 
AIVEELSAAA LVIFVSDAVL MTVLSAMFVI LGIRLLRDKR RRTAQIAEVM IVILILVILC
DMMLSGLTPD LIPYGVVLVV LVALSSYVDP SLAEERELRR KLRDMETREA AEEGTLGRDE
TGKGFIALNF FNLFWIFVVC CVLGLMIETV YHFLVVNPGH YQDRAGLLFG PFSPIYGFGA
VLMTVALNRF HDKNVVLIFL VSAVIGGAFE YLTSWFMQFA FGIVAWDYSG TFLSIDGRTN
GMFMAMWGVL GVVWIKLLLP WMLKLVNLIP WNWRYAVTTV CAALMIVDGA MTLLSLDCWY
QREAGKPPET AVAHFFAEHF DNQYMENRFQ SMSIDPGNAT RAK