Gene Elen_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1687 
Symbol 
ID8415986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1989955 
End bp1991094 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content73% 
IMG OID645024654 
Productcysteine desulfurase family protein 
Protein accessionYP_003182042 
Protein GI257791436 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.682432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTACT TCGACAACGC GGCCACCACC GCCGTCAAGC CGCCCGAGGT GGCCGAGGCG 
GTGGCGCGGG CCGTCAACAG CTTCGGGGGC GTGGGCCGCG GTGTGCACGA GGCGTCGCTC
GACGCGGGCT ATGCCGTGTT CCGGGCGCGC CAGCAGCTGG CTCGGCTGTT CGGTGCGGCC
GATCCGTCGT GCGTATCCTT CGCCAGCAAC GCCACCGAGG CGCTGAACAC CGCCATCGCC
GGGCTCGCGC GGCCGGGAGA CAAGCTGGTG ACCACGGCCG CCTCGCACAA TTCGGTGCTG
CGCCCGCTGT ACCGTCTGGC GGACGAGCGC GGTTGCGAGG TGGTCGTGGT GCCCCACGAC
GCGCGCGGCG CGCTCGACTA CGACGCGTTG GAGGCGGCGC TTCCGGGAGC GCGGCTGGCG
GCGGTGACCC ATGCTTCCAA CCTCACCGGC GACGTGTACG ACATCGCCCG CATCGCGCGC
CTGTGCCGCG AGCGCGGCGC GCTGCTGGTG GCGGACGCGG CCCAGACGGC GGGCGTCGTG
CCCATCGACA TGGGGCGCGA TGGGCTGGAC GTCGTGGCGT TCACCGGGCA CAAGAGCCTG
TACGGCCCCC AGGGCACGGG CGGCCTCGCC GTGGCGGAAG GCGTGGAGAT CGAGCCGCTG
AAGGTGGGCG GTTCGGGTAC GCACAGCTAC GACCGGCATC ATCCCGCGCG CATGCCCGAG
CGCCTGGAGG CGGGCACGCT GAACGCCCAC GGCATCGCCG GCTTGAGCGC GGGGCTGGCC
TATATCGAGG AGCGCGGCGT GGAGGAGCTG GGCGCGCAGG TGCGCGCGTT GGCCGAGCGC
TTCGAGCGCG GCGTGCGCGG CATCGACGGC GTGCGCGTGC TGGGCGGGGG CGGCGACGCG
GGGCGTTGCG GCATCGTCGC GCTCAACGTG GGAGATGCGG ACTCCGCGGC GATCGGCGAC
GCGCTCAATG CCGAATTCGG CATCTGCACG CGCGCCGGCG CCCATTGTGC GCCGCTCATG
CACGAGGCGC TGGGCACGCA GAGCCAGGGC GCCGTGCGGT TCAGCTTCAG CAGCTTCAAC
ACCGAGGACG AGGTGGACGC CGGCATCGCC GCCGTGGCCG CCATCGCCGA GGGGGCCTGA
 
Protein sequence
MIYFDNAATT AVKPPEVAEA VARAVNSFGG VGRGVHEASL DAGYAVFRAR QQLARLFGAA 
DPSCVSFASN ATEALNTAIA GLARPGDKLV TTAASHNSVL RPLYRLADER GCEVVVVPHD
ARGALDYDAL EAALPGARLA AVTHASNLTG DVYDIARIAR LCRERGALLV ADAAQTAGVV
PIDMGRDGLD VVAFTGHKSL YGPQGTGGLA VAEGVEIEPL KVGGSGTHSY DRHHPARMPE
RLEAGTLNAH GIAGLSAGLA YIEERGVEEL GAQVRALAER FERGVRGIDG VRVLGGGGDA
GRCGIVALNV GDADSAAIGD ALNAEFGICT RAGAHCAPLM HEALGTQSQG AVRFSFSSFN
TEDEVDAGIA AVAAIAEGA