Gene Elen_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1684 
Symbol 
ID8415983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1987548 
End bp1988603 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID645024651 
Productselenide, water dikinase 
Protein accessionYP_003182039 
Protein GI257791433 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.815145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA CAGAGCGCAT TCGTCTCACC CGCCTTACCG AGAAGGGCGG TTGAGCGGCG 
AAGTGGGGTC CGGGAGACCT CGAAGAGATA CTGAAGGACA TCGCGCCGCC GCCCGACGCA
GATTTGCTGC TGGGCTTCGA CACGTCCGAC GACGCCGCCG TCTACCGTCT GAACGACGAC
ACGGCCGCGG TGCTGACGCT CGATTTCTTC ACCCCGGTGG TGGACGACCC CTACGAGTTC
GGCGCCATCG CCGCGGCGAA TGCGCTGTCC GACGTGTTCG CCATGGGGGC GAAGCCTCTG
ACGGCGCTCA ACATCCTCGC GTTCCCGTGC AGCCTGGGCA CCGACGTGGT GGCCGACGTG
CTGCGCGGCG GCGCCGACAA GGTGCGCGAG GCGGGCGCGT TCGTGGTGGG CGGCCACTCC
ATCGAGGACG ACGAGCCGAA GTACGGCCTG TCGGTGTTCG GCACCGTGCA CCCCGACTGC
ATCGTGCGCA ACGGCGGCGC ACAGCCGGGC GACGCGCTGT TCTACACGAA GGTCCTCGGC
TCGGGGATCA TGAACTCGGC GTTCCGGGCC GGCTTCGAAG ACGACGAGGG CATGCGCCCC
GTCATCGCGT CCATGATGGA GCTCAACAAG GCGGGCTCCG AGGCGATGGC GGCCGCGCAC
GTGCACGCCG CCACCGACGT GACGGGCTTC GGGTTGGCCG GGCACCTCCA CGAGATGCTG
GACGCCTCGG ACGCGTCGGC CGAGCTCGTC TGGGACGACC TGCCGCTGTT CGAGGGCGTC
TACCGGTATT CCTGCGACTT CTGCCGCCCC GCCAAGACGT TCGGCATCAT CGACTGGGCG
CGCGCCTTCG TGAGGCAGGG CGGCCTCGGA GACGAGGAGT TCGAGAACCG CATGGGCGTG
CTGTGCGACC CGCAGACGTC CGGCGGCCTG CTGGTGGCCG TGGCGCCCGA CGAGGCCGAC
GAGTTCGCGC GCGCGTTCGA AGCGGCCGCC GGTCGCGCGC CCGCGCTGAT CGGCCATGTT
CGCGACGGCG CGGCGGGCGA GATCAGCATG AAGTAG
 
Protein sequence
MSETERIRLT RLTEKGGUAA KWGPGDLEEI LKDIAPPPDA DLLLGFDTSD DAAVYRLNDD 
TAAVLTLDFF TPVVDDPYEF GAIAAANALS DVFAMGAKPL TALNILAFPC SLGTDVVADV
LRGGADKVRE AGAFVVGGHS IEDDEPKYGL SVFGTVHPDC IVRNGGAQPG DALFYTKVLG
SGIMNSAFRA GFEDDEGMRP VIASMMELNK AGSEAMAAAH VHAATDVTGF GLAGHLHEML
DASDASAELV WDDLPLFEGV YRYSCDFCRP AKTFGIIDWA RAFVRQGGLG DEEFENRMGV
LCDPQTSGGL LVAVAPDEAD EFARAFEAAA GRAPALIGHV RDGAAGEISM K