Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1687 |
Symbol | |
ID | 8415986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1989955 |
End bp | 1991094 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645024654 |
Product | cysteine desulfurase family protein |
Protein accession | YP_003182042 |
Protein GI | 257791436 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01977] cysteine desulfurase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.682432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTACT TCGACAACGC GGCCACCACC GCCGTCAAGC CGCCCGAGGT GGCCGAGGCG GTGGCGCGGG CCGTCAACAG CTTCGGGGGC GTGGGCCGCG GTGTGCACGA GGCGTCGCTC GACGCGGGCT ATGCCGTGTT CCGGGCGCGC CAGCAGCTGG CTCGGCTGTT CGGTGCGGCC GATCCGTCGT GCGTATCCTT CGCCAGCAAC GCCACCGAGG CGCTGAACAC CGCCATCGCC GGGCTCGCGC GGCCGGGAGA CAAGCTGGTG ACCACGGCCG CCTCGCACAA TTCGGTGCTG CGCCCGCTGT ACCGTCTGGC GGACGAGCGC GGTTGCGAGG TGGTCGTGGT GCCCCACGAC GCGCGCGGCG CGCTCGACTA CGACGCGTTG GAGGCGGCGC TTCCGGGAGC GCGGCTGGCG GCGGTGACCC ATGCTTCCAA CCTCACCGGC GACGTGTACG ACATCGCCCG CATCGCGCGC CTGTGCCGCG AGCGCGGCGC GCTGCTGGTG GCGGACGCGG CCCAGACGGC GGGCGTCGTG CCCATCGACA TGGGGCGCGA TGGGCTGGAC GTCGTGGCGT TCACCGGGCA CAAGAGCCTG TACGGCCCCC AGGGCACGGG CGGCCTCGCC GTGGCGGAAG GCGTGGAGAT CGAGCCGCTG AAGGTGGGCG GTTCGGGTAC GCACAGCTAC GACCGGCATC ATCCCGCGCG CATGCCCGAG CGCCTGGAGG CGGGCACGCT GAACGCCCAC GGCATCGCCG GCTTGAGCGC GGGGCTGGCC TATATCGAGG AGCGCGGCGT GGAGGAGCTG GGCGCGCAGG TGCGCGCGTT GGCCGAGCGC TTCGAGCGCG GCGTGCGCGG CATCGACGGC GTGCGCGTGC TGGGCGGGGG CGGCGACGCG GGGCGTTGCG GCATCGTCGC GCTCAACGTG GGAGATGCGG ACTCCGCGGC GATCGGCGAC GCGCTCAATG CCGAATTCGG CATCTGCACG CGCGCCGGCG CCCATTGTGC GCCGCTCATG CACGAGGCGC TGGGCACGCA GAGCCAGGGC GCCGTGCGGT TCAGCTTCAG CAGCTTCAAC ACCGAGGACG AGGTGGACGC CGGCATCGCC GCCGTGGCCG CCATCGCCGA GGGGGCCTGA
|
Protein sequence | MIYFDNAATT AVKPPEVAEA VARAVNSFGG VGRGVHEASL DAGYAVFRAR QQLARLFGAA DPSCVSFASN ATEALNTAIA GLARPGDKLV TTAASHNSVL RPLYRLADER GCEVVVVPHD ARGALDYDAL EAALPGARLA AVTHASNLTG DVYDIARIAR LCRERGALLV ADAAQTAGVV PIDMGRDGLD VVAFTGHKSL YGPQGTGGLA VAEGVEIEPL KVGGSGTHSY DRHHPARMPE RLEAGTLNAH GIAGLSAGLA YIEERGVEEL GAQVRALAER FERGVRGIDG VRVLGGGGDA GRCGIVALNV GDADSAAIGD ALNAEFGICT RAGAHCAPLM HEALGTQSQG AVRFSFSSFN TEDEVDAGIA AVAAIAEGA
|
| |