Gene Elen_0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0943 
Symbol 
ID8415233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1147528 
End bp1149225 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content45% 
IMG OID645023907 
Producthypothetical protein 
Protein accessionYP_003181304 
Protein GI257790698 
COG category[S] Function unknown 
COG ID[COG4938] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000190881 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00360431 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCGCTG GTTTTAAGAT TAAGGAAATT GAGCTCCGTT CAGAAGAAGG AACTCGGTTC 
AGCCCAAAAG CACTAACAAT CATTGTTGGG CCAAATAATG CTGGGAAAAG TAGGTTTCTG
AAAGAAGTTC GGTCTGCCCT TTTGGGTAGA TTGAGCGACG AAGACGGCGG CCTAATCCTG
GGTCGGAAAA TAATAAGCAC TATTGAATTG CTGCTTCCCG AATCGACGGA CGCTCTATTT
GAGTGGTTTG ATCTGGATCG AAAAGTGGTT CGCGATGAAA ATGGAAACTA TGGTGTAAGA
GAGTACTGCA ATACTGGAAT TAATATTAAT CAATATGGTC AAATAGTTCG AGAAGAGTGT
GCGACAAAAT ATCAGGGACA GTGGAAGGAT GTAATCGAAT CTCATTATGC CTCTCCCAAT
GATGCGCGTG CTTTAGATGC GCTTCTCAAT TTCATAGGGC CACTGCTTGT CGGCTATTCT
GGCACAGAAG ATAGATTGAT TCTTTCCGCT GGAGAACCGT ACTACGGCGT GGCAGATTCA
AATACGAATT TTCTTTCACG AGTTCGATCC CAAGATCAAA TCCTTGATGA CTTGTCGGAA
ATTTCTAAAA GATTATTTGG GAAAGATGTC GTACTCGATG ACGTTACAAA AGGTGGAATG
ATCCAATTCA AGACTGGTTC GGATTTTTCG AGTTACAGAA CAAGTGCTCG TGGAACCTCA
GATTTCGAAT TCCTGTTAGA GCAGGGTGTT TCCTTGAAAG ACGAAGGAGA TGGGTTTCGA
AGCTTCGTTT CTGTATATCT TGCTCTTAGG TCGGGAGACA AACCTGTAGT TCTAATTGAC
GAGCCAGAGT CTTTTCTTCA TCCGCCTCAA GCTTACGAAC TGGGGAAGGT GATTGGCTCT
TCAGCTGAAC AATGCAGCCA AATGATCATA GCGACGCATA GTACGCATCT TCTTAATGGC
ATCATGTCAA CATGCGACTG GGATAACTGC GACATTTTAA GACTACAGCG CGACGGAGAC
TCTCTTCGAG CGAACTTGCT TGACCGCGAG GGCTTGGATA GGGTGAAAGG GGATCCTCTG
CTGAGAAGCA CGCGTCTTTT GGAGGGCGTT TTTACGCGTG TCGTAGTTGT AGTGGAGTCG
GAGTCGGATG AGCTTGTGTA TCGGGAAATC CTGAATAAAG TGGGCGTAGC CGACGAGGCG
TTCTTTGTTA ACGTGCACAG CAAGGACAGA ATTGCATTCG CGGTGGAATT TTATAAGAAC
GTTGGCGTTC CCTGCTGCGC AGTGATGGAT TTTGATATTT TGAATGATAA GAATAAGTTC
AAAAGAGTCT TGAAGTGTTT TGAATGTGAC CCTAGCGGAA GGCTTTCCCA GATAGCTCAA
GAGACAAGAG ACGCTATAGA ATGCGATGCG GGAAAGCCCG AAGAGACAAA GCTACGGTAC
AAACGCGATC CTTTGATGTA TCTAGATAAG ATTGAGAATG AAGTTGAAGA GTTGCTAGAT
CGATGCTTGG AGTGCGGTTG TCTTATTGTG AGGACGGGCG AACTCGAAAC CGTTTTTGGA
GAAAAGGTAG CCTATCGATC TTCGAAACGG GCTTGGCTCT CCGAAGCCCT GGATTATCTG
AACCATTTGG AGCCTGGCGA ATTAACTTCT CTTGCGATCG TTTCCGATCT CATAAAGATG
TTGCAGGTTG CGAAATAG
 
Protein sequence
MVAGFKIKEI ELRSEEGTRF SPKALTIIVG PNNAGKSRFL KEVRSALLGR LSDEDGGLIL 
GRKIISTIEL LLPESTDALF EWFDLDRKVV RDENGNYGVR EYCNTGININ QYGQIVREEC
ATKYQGQWKD VIESHYASPN DARALDALLN FIGPLLVGYS GTEDRLILSA GEPYYGVADS
NTNFLSRVRS QDQILDDLSE ISKRLFGKDV VLDDVTKGGM IQFKTGSDFS SYRTSARGTS
DFEFLLEQGV SLKDEGDGFR SFVSVYLALR SGDKPVVLID EPESFLHPPQ AYELGKVIGS
SAEQCSQMII ATHSTHLLNG IMSTCDWDNC DILRLQRDGD SLRANLLDRE GLDRVKGDPL
LRSTRLLEGV FTRVVVVVES ESDELVYREI LNKVGVADEA FFVNVHSKDR IAFAVEFYKN
VGVPCCAVMD FDILNDKNKF KRVLKCFECD PSGRLSQIAQ ETRDAIECDA GKPEETKLRY
KRDPLMYLDK IENEVEELLD RCLECGCLIV RTGELETVFG EKVAYRSSKR AWLSEALDYL
NHLEPGELTS LAIVSDLIKM LQVAK