Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1981 |
Symbol | |
ID | 8416292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2321102 |
End bp | 2322166 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024958 |
Product | protein of unknown function DUF523 |
Protein accession | YP_003182334 |
Protein GI | 257791728 |
COG category | [S] Function unknown |
COG ID | [COG1683] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATAG CCGTGAGCGC ATGCCTGCTG GGGGAGCCGT GCCGCTACGA TGGGAAGTCG CGTCCCTGCG AGGACGTGCT GAAGCTGCAC GACGCTTGCG AGATGGTTCC CGTGTGCCCC GAGGTGCTGG GCGGGCTGCC TGTGCCGCAT GCCCCCTGCG AGATCGCCGC CGCCGAGCGC GCGCTGCGCG TGACGGACGC GGACGGGGTC GACGTCACGG ACGCGTTTTT GGCCGGAGCC GCCAAGACGG TGGAGCTGGC GCAGGAGCAG GGGTGCAAGC TGGCCGTGCT CAAGGCGAAG AGCCCTTCGT GCGGATGCGG CCTCGTGTAC GACGGCGCGT TCGCCGGCGA GCTCGTGCCC GGTTACGGCG TGGCGGCGCG CGCCTTGCGC GAGGCGGGCG TGCGGGTGCT CGACGAGGTG CGGTTCGCGG CTTGCGTTCG GGCCGGCGAG GCGCGGCATC CCGGTTGTCC GCCGGCGATT CTGGCGGTGA CGTCGGGGGA GTGCCCTGCG CTCGAGACCG AGCGCCTCGT GCTGCGCCCG TTCGTTTCCG ACGATATCGA CGACGTGTAC GCCTACTGCA GCGACCCCGC CGTCGGGCCC GATGCCGGAT GGGCCCCGCA CCGCACGCGC GAGGACTCGC GCATGTTCGT GGAGGTGATC GCGAGCGAGC CTCATGTGTT CGGCATCTTC GAGAAGACGG GCGCGGGGAC GGGAGCCACG GGGCCGTGCA TCGGGTCGAT CGGCCTCATC CGCGATCCGC AGCGGCGCAA CGTCGACTGC CTCATGCTGG GCTATGCGCT CGCGCGCACG GCATGGGGGC GGGGCTGCAT GACCGAGGCG GCGGACGAGA TGCTGCGCTA CGGCTTCGAG GAGCTGGGGC TTGGCCTGAT CACGTGCACG CACTACACGT TCAACGATCG TTCGCGCCGC GTGATCGAGA AGGCGGGCTT CGTCCACGAG GGCACCATCC ACGGCGCCGA AGCAACCCCC GACGGCCTCA TGCAGGACTT CGAATCGTAC TATCTTCCCC GCGAGCTTTG GGATGAGGCG AAGGGACGGG GCTAA
|
Protein sequence | MPIAVSACLL GEPCRYDGKS RPCEDVLKLH DACEMVPVCP EVLGGLPVPH APCEIAAAER ALRVTDADGV DVTDAFLAGA AKTVELAQEQ GCKLAVLKAK SPSCGCGLVY DGAFAGELVP GYGVAARALR EAGVRVLDEV RFAACVRAGE ARHPGCPPAI LAVTSGECPA LETERLVLRP FVSDDIDDVY AYCSDPAVGP DAGWAPHRTR EDSRMFVEVI ASEPHVFGIF EKTGAGTGAT GPCIGSIGLI RDPQRRNVDC LMLGYALART AWGRGCMTEA ADEMLRYGFE ELGLGLITCT HYTFNDRSRR VIEKAGFVHE GTIHGAEATP DGLMQDFESY YLPRELWDEA KGRG
|
| |