Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2493 |
Symbol | |
ID | 8416817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2922090 |
End bp | 2923628 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025474 |
Product | hypothetical protein |
Protein accession | YP_003182837 |
Protein GI | 257792231 |
COG category | [S] Function unknown |
COG ID | [COG3864] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCG AAGCCGTCGA GCCGCTTGCC CGCCAAGCGC TCGACCTTGC GAAAAACGCG CTGCTCGTGA ACCTGCGCTT CATGAACGCT GCGTTCGCGC GCCTGCGCCC TTTCCCCGTG CGCGATGCCA CGCTGGCCAC TGACGGCCTT CACATCCGCT TCGACCCCGC AACGCTCGCG CGCCTCTACG CCGCGGAACC GTCCGAACTC ACTCGCGCCT ATCTGCACAT CGTGCTGCAC AACGTGTTCC TGCACCTCTA CCCCGGCCCG CACGTGGACG TTGCGCGCTG GGACGCCGCA TGCGACATCG TCGTGGAACG CACCATCTCC GAACTCGACC TGCCCGCCAC CCGCACCGCT CGCGCCGAGC GCCAGCGCGC GACCCTCGCG CGCATCGACG CGGCGCTGCC GCTGGCAACC GCAGAAACCG TGTACCGGTA CCTGCAGGAT GAGGGGCTCG ACGATGCCGA GCTGGCCGAT CTGCGCGCAC CGTTCTACAT GGACGACCAT GAGCCCTGGT ACCGCCTCGC AGCCGCAGAG GAGGCCCGCA AAACGAATTC GGAAGAGGGC GAACGCGATG AGGGGGCGGA CGCGCAGGAC GGAGCCGCGT CGGCCCAATC CGCCACGAGC GGAACCGACA TGGAGATGCC GTCCGATGCC GACCAGGCGA AGCCGAACCA CAAGTCGCAC CAGGCGGTCA GCCAGGACGA CATCGTGCAG AAAGAGGCTC CCGAACAGGT AGGACGCTCC ATCGACGACC GCTTCGCCGA CACGGTGAAC CTAGATCGGT CCAAGGAGCA GTGGAAGAGC GCGGCCTACG AGATGGGCGT GCAGCTTGAC GCCTACGCGA AGCTGTGGGG CGTCGAGGGC GCGAACCTTG CCATGAACCT GCGCGCCGTG ACGCGCGAGA AGCAGGACTA TCGCGAGTTC CTGCGCAAGT TCGCCCGGAT GGGCGAGCAG ATCCGCGTGA ACGACGACGA GTTCGACTAC GTTTATTACT GCTACGGGCT CAAGCGCTAC GGCAACCTCC CGCTCATCGA GCCGTTGGAA TACGTGGAAG AGCGGCGCAT CCGCGACTTC GTCATCGCCA TCGACACCTC GGCCTCCACC AAGGACGGCC TCGTGCGCCG TTTCATCGAG AAGACGTACG CCATCCTCAG CCAAGAGACC AGCTTCTTCG CCGACATGAA CGTGCTCATC GTGCAATGCG ACGCCGCCAT CACGGACGTC GCGCGCATCT CGAACCTGCG CGATCTGGAC GACTACCTGG ACGGCCTCGA GATCAAAGGA TTGGGCGGCA CGGACTTCCG CCCCGTGTTC GCCTACGTGG ACGACGCCGT GGAGCGCGGC GACCTCGTGA ACCTGGGCGG CCTCATCTAC TTCACCGACG GGCAGGGAAC CTATCCCGCC CGCAAGCCCG ATTACGACAC CGCGTTCGTC TTCGTCGACG ACGCCTCGGC CGCCGCGAGT CCGAACGTCC CTGCGTGGGC CATGAAAGTG GAGCTCGACG AAACCGTCGT ATTGGAAGAA ATGTCCTAA
|
Protein sequence | MAVEAVEPLA RQALDLAKNA LLVNLRFMNA AFARLRPFPV RDATLATDGL HIRFDPATLA RLYAAEPSEL TRAYLHIVLH NVFLHLYPGP HVDVARWDAA CDIVVERTIS ELDLPATRTA RAERQRATLA RIDAALPLAT AETVYRYLQD EGLDDAELAD LRAPFYMDDH EPWYRLAAAE EARKTNSEEG ERDEGADAQD GAASAQSATS GTDMEMPSDA DQAKPNHKSH QAVSQDDIVQ KEAPEQVGRS IDDRFADTVN LDRSKEQWKS AAYEMGVQLD AYAKLWGVEG ANLAMNLRAV TREKQDYREF LRKFARMGEQ IRVNDDEFDY VYYCYGLKRY GNLPLIEPLE YVEERRIRDF VIAIDTSAST KDGLVRRFIE KTYAILSQET SFFADMNVLI VQCDAAITDV ARISNLRDLD DYLDGLEIKG LGGTDFRPVF AYVDDAVERG DLVNLGGLIY FTDGQGTYPA RKPDYDTAFV FVDDASAAAS PNVPAWAMKV ELDETVVLEE MS
|
| |