Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2083 |
Symbol | |
ID | 8416401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2450385 |
End bp | 2451332 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645025066 |
Product | protein of unknown function DUF552 |
Protein accession | YP_003182435 |
Protein GI | 257791829 |
COG category | [S] Function unknown |
COG ID | [COG1799] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.009742 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000000000013242 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCTGC CAAAGATCAA GAAATCGGAG CACGGAATGC TCGAGGGAAT CAAATCGAAA CTGGGTTTCG CAGACGCCAA CCCGCATTAC GACGACGGCT ACTACGACGA GGGGTTCGAC GACTACAGCG AGGAGTACGG CGAGTACGGT CCCGACTACA ACGAGGACGA TTTCCCCGCC GACGATGCTC CCGGTTCGCG TTATGAGCCC TATGCGCCCG TGACTTCGCG TCCTGCGCGC GCCTCGCACG CGCGCTCCTC GGCGCGCAGC TCGTCCGTGG GATCCGCGAA GCTCGTGTCC ATCGACGACG TGCGCGCGCA CACCCAGGTG CCCGAGAGCC TCAACCGCGA TCCGTTGCCG CCTCGCCGCG TGACGTCGCC TTCAAGCGGC TCCTACCGCG GCGATCGCAC CATGGTGGAA GCGGCGCAGC CCGCCCCGGC GAACACGCCT ATCGCGCGTG CGGCCGCCGC AGCGAACCGC GAGCGCTCCG AGAGCCTGAA CTCGCTGTTC ACCTCCACGT CCGACGATGC GCCGAGCGTT TCTGGGCCTT CTGGCTCGGG CGTCGCGGTG CAAACGGCAA CCACCGCTTC GGGCGCTACC GTGGCAACCG CCACGGCGAC GACTGCGGCG TTCGATCCGT TCGACGCCTA CGCGGGCGCC GGGGCGGTCA AGCACAACCC CTCCCGCTCG GTCACCGTGC TCAAGCCGGC CAGCTACGCC GAGGTCGAGC GCATCGCGAA GGCTCTCAAG GCGGGGGATG TGGTGGTGCT CGCGCTGCGC AACACGCCCG ACAATCTGTC GAAGCGCATC CTCGACTTCT CGTTCGGCGT GTCGAGCGCT CTCGACGCCA GCGTGGACTG CGTGGCCGAC AAGGTGTTCG TCATCTCGCG CGGTGCTGCG CTCACCGATG CCGAGCGCAT GAGCCTGCGC GGGCAGGGCG TGCTGTGA
|
Protein sequence | MELPKIKKSE HGMLEGIKSK LGFADANPHY DDGYYDEGFD DYSEEYGEYG PDYNEDDFPA DDAPGSRYEP YAPVTSRPAR ASHARSSARS SSVGSAKLVS IDDVRAHTQV PESLNRDPLP PRRVTSPSSG SYRGDRTMVE AAQPAPANTP IARAAAAANR ERSESLNSLF TSTSDDAPSV SGPSGSGVAV QTATTASGAT VATATATTAA FDPFDAYAGA GAVKHNPSRS VTVLKPASYA EVERIAKALK AGDVVVLALR NTPDNLSKRI LDFSFGVSSA LDASVDCVAD KVFVISRGAA LTDAERMSLR GQGVL
|
| |