Gene Elen_0816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0816 
Symbol 
ID8415106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1014055 
End bp1016580 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content65% 
IMG OID645023782 
Producthypothetical protein 
Protein accessionYP_003181179 
Protein GI257790573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACT CTAAGCTGCA CCTCATGTCA TTCGCACCCG TCCCCGACAC CAAGACGACC 
GTGCACGTCG TCGGCTTGCC CGACAGCCTG CGCGACGCCC TCTTCACGCT CATGCCCCCG
AGGAAAGAGG GCGGCTACCT CAACACCAAA GTCCTCAAGG ACGACCTGCG CTGCTGGCTC
GACCGGGCCG TCGAGCTGAA CCCCGTACGC CCCAACGTGC ATACCGACAG CTGGCTTATC
GCCCTCGCCC CCATTGATCT CGCGAAGCTA TGCAACGTCA TCGCGGTCTG GATCTCCTCC
CGCAAGGACA TCGATACCCA ATCCCCCGCC TACCGCAAGG TGATGGGGAT GCTGCACCCC
GAGACGTTCG AAGAGGCCGT GCGTGGCGAA GAGATCTGCC TGTTCAGCGG CGACGGCCGT
CCCGCGGGCG GGCTCACCTT CCCCGCATTC TCCGCGCAAG TCGCCGACGT CATCGCCGGC
ATCCCCCTCG AACTCGCCAA CGGCATGGTC GAGAACTTCA GCAGAGTCTC ACGCGGAAAC
GGAAACGTCT ACGAGCTGAT TTCCGATATC CACTGGCACA AGGAGGACCC GTGGGCGTTC
GCCCTCAGAT TCCACGTCGA GACGCTGCCG GTCGGCCGCA AGGCAAGGCT CAACATGGAT
GTCGCCGTCC GCCGCTTCAT CGGCAAGCCA TGGCAGGACG ACCCGTTCCT CAAACACGAT
GTCAACGCCT ACGTCCGCAC GGAAGGCGGC ACGCTCCGCG TCGTCCCCTA CGGCTACGAC
AAACAGAAGC GCGACTTGGC CTGGGACCCC GCGGCCCTGG CGAACTACGA GTTCGCCAGC
GGGACCGGAT TGCCCGCAGT CCGCGAGTAT CTTGAGGACA TGGGCAGGTA CGCCCGCGAC
GGCTCGCAGC CGCAGATCCT ATCCCCCTAC GCGATGACCG CCTCCTGGGC ATCGAAGCCC
TCGGTCGCAA GCGGGGCATC CGTCATCGAC AAGGCCATGT TCTTCGAGGC CGTCGCGGCT
CGGCTCAAAG ATATCGCGGA GCCCGTCGGT GCGCTCGACT CGCTTCAACT GACCCATCTC
AAGGCGTCCA TAGAGGAACC CAGGCAGGCG GATTGGGACA AGGACCCCGT CAGCGCACGG
GCGAGACAGG AGGCCTGGGG CCGGGCCAAC CGCGCCAGAC TCGCCCGCTG CACCGGCAGG
GACAGGGCCG TATTCCAGCT GATCGGCAAC CAGGACGATG CCCGGCTCCT CGATATGGCA
AGGGCCGAGA TCTCCCGCTT TCTCGGCGGG GAGGGGGCCG TCGACGACTT CGAGGTCGAG
ATCGATAACA TCCCGGCAAA CGACCTCCTG AACCGCATGG AGAATACCGG CGACAGCCAG
GCGAAGATCC GCTGGCGCAA AGTCGCCGCC GCGCTTCCCG AGGCGACCGA CCCGACCGCG
TGCATAGTGG TCCTGCCGGG CGCCGAGAGC TACAAGCCCA AGGGCAAAGA CGACGGCGGA
GACCCCAAGC GCGCCCTGCG CATCGCGTTC GCGAAAACCG GGCGCCTGAC CCAATTCATC
GAACCCGAGG ACTCGAAGGA CAGTCCCGAG ATCCGCGCGC GCGTCGCCGT CCGCGACCTC
ATGCGCCAGC TCGGTTTCGT ACCCGAACCG GCACGCAATT CGCGCGGCAT CGACACCTCG
ATCCCGGCGA TAGGGTTGAA GGTCTACAAC TCCGGCAACG GCAAGGCGCG GGCGAGCTTC
CCCTATTGCG TCCGACAGGA CATGAGAAGC GGTGCCGTCA CCGTGTACTG CCCCCTCCTA
CCTGACGGAA GCCTACCGTA CTGGCGCGCC CTCATCGAGT TCGCCCGGCT GAGCGGTTCC
GAGGGCTTTC CGGACTCCTG CAAGCGGGCG AACGGAATGG CCCTGAAGAG GATGCTCCAC
GGCATAGTCC GCGCCACCGG CGACAGGCCG GAACTCCTCC TCATCAACTC CTACGGCCGC
ATCCGGCAGA GAGACTGGTG GCCCGGCATC AGCGACTCCG GCCTCGAGTC GGGACCCTTG
AGCTACGGAC CGACCGGTTA TGAGGAGCCG CTTGGCCTCG CGGGCAGCAA GCTGCGCATC
CTGCGTATAC GCTCCGGTCT CAACGGCGAG ATCCCCGACT GGTTCACGGA TGAGGTGGCC
GCGGGCGGCG CAGACGACGG CACCGTGCCG AACAGGCGCG ACAAGCAGGG ACTCTTCAAG
ATGGACGGCT ACTTCCTCGC GTTGGCCCCG CGACCGGGCG ACGCCCAGTA CAAATGGTCC
GCACGCGGAT CGAAGTACGA TTCGCCCACC GCGGCGTTCT GCGAGAAGAC GATCAACGAG
TACTGCCTTC TGTCGCCCGG TGGCGAGGCC GAGGCGCTTG CCAGCGTCAA ATACGCCGAG
GCCCTGCGCG GCTGCATGGT GCAGCTCTAC AAGAACGATA TGAGGGTGAA CCTTCCCGCC
CCGCTCCATC TCGCCGAGCA GGTCGAGGAG TATATCTGGG ACTGGGAGCT CACCGGCAGA
CGGTAG
 
Protein sequence
MSDSKLHLMS FAPVPDTKTT VHVVGLPDSL RDALFTLMPP RKEGGYLNTK VLKDDLRCWL 
DRAVELNPVR PNVHTDSWLI ALAPIDLAKL CNVIAVWISS RKDIDTQSPA YRKVMGMLHP
ETFEEAVRGE EICLFSGDGR PAGGLTFPAF SAQVADVIAG IPLELANGMV ENFSRVSRGN
GNVYELISDI HWHKEDPWAF ALRFHVETLP VGRKARLNMD VAVRRFIGKP WQDDPFLKHD
VNAYVRTEGG TLRVVPYGYD KQKRDLAWDP AALANYEFAS GTGLPAVREY LEDMGRYARD
GSQPQILSPY AMTASWASKP SVASGASVID KAMFFEAVAA RLKDIAEPVG ALDSLQLTHL
KASIEEPRQA DWDKDPVSAR ARQEAWGRAN RARLARCTGR DRAVFQLIGN QDDARLLDMA
RAEISRFLGG EGAVDDFEVE IDNIPANDLL NRMENTGDSQ AKIRWRKVAA ALPEATDPTA
CIVVLPGAES YKPKGKDDGG DPKRALRIAF AKTGRLTQFI EPEDSKDSPE IRARVAVRDL
MRQLGFVPEP ARNSRGIDTS IPAIGLKVYN SGNGKARASF PYCVRQDMRS GAVTVYCPLL
PDGSLPYWRA LIEFARLSGS EGFPDSCKRA NGMALKRMLH GIVRATGDRP ELLLINSYGR
IRQRDWWPGI SDSGLESGPL SYGPTGYEEP LGLAGSKLRI LRIRSGLNGE IPDWFTDEVA
AGGADDGTVP NRRDKQGLFK MDGYFLALAP RPGDAQYKWS ARGSKYDSPT AAFCEKTINE
YCLLSPGGEA EALASVKYAE ALRGCMVQLY KNDMRVNLPA PLHLAEQVEE YIWDWELTGR
R