Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0562 |
Symbol | |
ID | 8414847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 715135 |
End bp | 716355 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023534 |
Product | hypothetical protein |
Protein accession | YP_003180936 |
Protein GI | 257790330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAACG TATTCAAAGG CGCACTGCTC GCGCTGGTGC GCGAGAAGAG CGTGTTCATC TGGTCGCTGG CGTTTCCCCT GATCCTGTCC ACGATGTTCG TGTTCATGTT CGCGAACCTG GACGAGGCGG GGCAGTTCGA GCCCATCCCC ACCGCGGTGG TGGCCGACGA GAACTACGAC GCGGCGCCGG GGTTCTCGGA GATGATCGAC ACGCTGGCGG AGCCGGGAGC CGACCAGATG CTCGACGTGG CGCGCGTAGC CACCGAGCAG GAGGCGCGCG ATCTCATGAG CGGAAACGAT ACCGCAGGAG CGGGCTACTT CAACATCTCG GGCGATGGAG CGGCCGGGTA TTTCACGGTT GACGCCGACG GCATGCCCAC CGTGCACGTG AAGGCGGGGG TCACGCCCGA CTCGCTGGAC AGCGCCTACC AGTCCATCTT GAAGACCATC GGCGACGGAT ACGTGCGCAA CGCGGCGCTC ATCGAAGACG TCGCCGCCGA GAACCCCGCC GCGCTGGCCG ACATGGCGGC GGTGGAAAAG CTGCTGGACG CCGGCGATCT CACCGAGAAG ATCGACGTCA CGCAGAACCC GCCCAAAGAA TCCGTGCGCT ACTTCTTCGC ATTGCTGGGC ATGGCGGCAC TGTTCGGTGG GCAGATCGGG ATGATCGCTA TCTGCCGCAC GCAGCCGAAC CTGAGCGCGC TGGGGGCGCG GCGCGCCGTG GGAGCGCTCA GCCGCGCGAA GACGCTGACG GCGACGCTGG CCGCCAGCTG GGTGCTGACG TTCGCCTGCA TCGCCATCGC GTATCTGTAC ATCCGGTTCG TCGCCGGCGT GGATTTCGGC GGACGAGATG CGATATGCAT CGCCGTGATC GCCGCCGCGG CCTTGACGGC GACGGCGTTC GGCACGCTGC TGGGCTCGCT GCCGAAGATC GACGAAAGCG TGAAGGGCGG CATGCTGTCC GGCATCGTGT GCTTCGCCTC GCTGTTCGCC GGGCTGTACG GCTCGCCCAC GATGAAGCTG GCCGATACCG TGAACGCGGC GGTGCCCGCG GCGCAGCTGG TCAACCCGGC CGTGCAGATA TCCCAAGCGT TCTACAGCAT CATGTACTAC GACACCTACC AGCGCACGAT CGAGCACATC CTGATCCTGC TGGCCATGGC TGCGGTACTG TTCGCCGCGT CGGCTCTGTT CATAAGGAGG CAGCGCTATG CAAGTCTTTA A
|
Protein sequence | MFNVFKGALL ALVREKSVFI WSLAFPLILS TMFVFMFANL DEAGQFEPIP TAVVADENYD AAPGFSEMID TLAEPGADQM LDVARVATEQ EARDLMSGND TAGAGYFNIS GDGAAGYFTV DADGMPTVHV KAGVTPDSLD SAYQSILKTI GDGYVRNAAL IEDVAAENPA ALADMAAVEK LLDAGDLTEK IDVTQNPPKE SVRYFFALLG MAALFGGQIG MIAICRTQPN LSALGARRAV GALSRAKTLT ATLAASWVLT FACIAIAYLY IRFVAGVDFG GRDAICIAVI AAAALTATAF GTLLGSLPKI DESVKGGMLS GIVCFASLFA GLYGSPTMKL ADTVNAAVPA AQLVNPAVQI SQAFYSIMYY DTYQRTIEHI LILLAMAAVL FAASALFIRR QRYASL
|
| |