Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2535 |
Symbol | |
ID | 8416859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2967174 |
End bp | 2968379 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 645025516 |
Product | hypothetical protein |
Protein accession | YP_003182879 |
Protein GI | 257792273 |
COG category | [S] Function unknown |
COG ID | [COG4260] Putative virion core protein (lumpy skin disease virus) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.944384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTCA TCAAGGCAAT GGGAGGTTCT ACTCGGGGAG TTCTTGCCGA TTCGTGGAGA GATTTCTTCT ACTGCGAGTC GCTTGATGCC TCCACATTGG CGTCAAAAGG ACAGAAGAAG ACAGGGAACC CTGATCGTTC CTCAAACGCC AAGGGAGACG AAAACGTTAT CTCGAACGGT TCAATTGTTG CCATAAACGA TGGCCAGTGC ATGATCATAG TCGAATCAGG AGCTGTCGTT GACCTTTGCG CCGAACCCGG CGAATACCTA TATGAAACGT CAAGCGAGCC GAGCGTCTTC TACGGCCCGT TGGGCGCAAA CGTCAAGAGC ACGTTCAAAG AGATGCAACG TCGTATAGGA TTCGGCGGCA GTCCCGGGAA AGACCAGCGC GTTTACTACT TCAACATCAA GGAGATCGTC GGAAACAAAT ACGGAACCCC TAACCCCGTT CCCTTCCGCG TCGTCGATGC CAATATTGGC CTTGATATCG ACATAGCCGT ACGCTGCAAT GGCGAATATT CGTACAGGAT AGATAACCCC CTGTTGTTCT ACCGCAACGT TTGCGGGAAC GTTGAAACCA CCTACACAAA GGATCAACTG GATTCTCAGT TGAAAAGCGA GCTATTGACC GCCCTGCAAC CTGCATTTTC CCGCATCTCG GCTGCTGGCG TGCGATACAG CAACGTTCCC GCGCATACTC GCGAACTTGC AGCGCTTCTA AACGAAGAGC TCACTGACAC ATGGCGAAGC CTTCGCGGAA TGTCCGTCGT GTCCTTTGGA ATGAATTCTA TTCGAGCTTC GGAAGAGGAC GAACTCGTTA TCAAGCGACT TCAGAGCGCT GCGGTGATGC GCGATCCGAA TATGGCAGCC GCCAATCTGG TAGCCGCCCA ATCCGACGCC ATGCGCATCG CGGCAGGAAA CGCAAACGGA GCAGCTAACG GCTTTATCGG TTTAGGGCTA GCGAACATGA CAGGCGGAAC GGATGCGGGA CGTTTGTTCA CCGACGCAGC GACCAGCTTT CATCATTCCG GATCCTTCAA TCAGCAGAAC TGGACTTGCT CTTGCGGAGT AAAGAACTCG GGGAACTTCT GCCAAAACTG TGGCAAAGAG CGCTGCAGTG ATTCCGCATG GACTTGCCCT TCATGCGGTA CGAGCAGCGC AGGGAACTAC TGCTCGCAGT GCGGCAAAGC CAGAACGCAG CCCTGA
|
Protein sequence | MGLIKAMGGS TRGVLADSWR DFFYCESLDA STLASKGQKK TGNPDRSSNA KGDENVISNG SIVAINDGQC MIIVESGAVV DLCAEPGEYL YETSSEPSVF YGPLGANVKS TFKEMQRRIG FGGSPGKDQR VYYFNIKEIV GNKYGTPNPV PFRVVDANIG LDIDIAVRCN GEYSYRIDNP LLFYRNVCGN VETTYTKDQL DSQLKSELLT ALQPAFSRIS AAGVRYSNVP AHTRELAALL NEELTDTWRS LRGMSVVSFG MNSIRASEED ELVIKRLQSA AVMRDPNMAA ANLVAAQSDA MRIAAGNANG AANGFIGLGL ANMTGGTDAG RLFTDAATSF HHSGSFNQQN WTCSCGVKNS GNFCQNCGKE RCSDSAWTCP SCGTSSAGNY CSQCGKARTQ P
|
| |