Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1934 |
Symbol | |
ID | 8416239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2265914 |
End bp | 2267572 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024905 |
Product | protein of unknown function DUF88 |
Protein accession | YP_003182287 |
Protein GI | 257791681 |
COG category | [S] Function unknown |
COG ID | [COG1432] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000288494 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0274849 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATATCA AGCAATCGTC TGAAAAGCGG TTCGCACTTT TGATCGATGC CGACAACGTG TCGGCGAAAT ATATAAAGCC CATCACCGAC GAGCTGTCGA AGTACGGCAC CGTCACCTAC AAGCGCATCT ACGGCGACTG GACGCTCACG CTCCATGCCA AGTGGAAAGA CGCGCTGCTG GAGAACTCCA TCACGCCCAT CCAGCAGTTC GGCTACACTC AAGGCAAGAA TGCCACCGAC TCGGCCATGA TCATCGACGC CATGGACATC CTGTACACGC GCTCGGTGGA GGGCTTCTGC ATCGTGTCGA GCGACAGCGA TTTCACGCGT CTGGCCAGCC GTATCCGCGA AAGCGGCCTC ACGGTCATCG GCATGGGCGA GAAGAAGACG CCCACGCCGT TCAGAAAGGC ATGCGACATT TTCACCACGC TGGAGCTTCT GCTGGGCGAC ACGGGCGGCA AGTCGGGCGG ACGCAACAGG AACCGTCACG ACCAGGGCTC GTCGTCGAAC GGCCAGGGCG CTGGAACCAC CACCATGAGC AAGGACGAGA TCGAGCAGGC CGTGGTGAAC ATCATCACGG ACAACCAGAA CAACGGCAAG TCGACGGGGC TCGGAGAGGT GGGCAGCCGT CTGCTGAAGC GCTACCCCGA CTTCGACGTG CGCAGCTACG GCACGAACCT GCTGTCGAAG CTGCTCGACG AGTTCGCCAG CGTGCAGATC ATCAAGGACG GCAGCTCGGT GGCCGTGGTG CTGGCTGAGG GTGCGAACGC TCCGAAGGAC GCTTCCCCGG AGGCAGAGCA GGCACCCGAG ACCAAGCAGG CCGACGACGT GAAAGACGCG CCGGTTGCAG AATCTGAAGG ATCGACGGAC GCACAGGGCG CCGCTGAGGC GAAGCCGGTC GAGAAGAAGC CCGCGTCGCG CCGTCAGCCG CGTCGTCGCA AGGATCAGGT CGCGGCGCAG CAGGGATCCG AGGCTACCGA GGAGAAGCCG GTTCAGGAAC CGGAACTCTC AGCGGAGCCC GCAGGCGAGC AGCATGATCG TCTGGCCGAA CCTGTCGTCG AAGCCGAGCC TGCCGAGCAG CCTCCCTCGG ACAACCGTCC CGGACGTGCC GCCCGCATGC GCGCGGCTGC TTCTCGCTCG AGAGGCTCGG AAGGGCGCAA GCAGGCGGGG AAGAAGCAGA CCGAGAAGGG TGAGCGCTCG GACGGCGAGG TGCCTGCCCA GGCCGCCGCG CCAACCGAGG AGCAGAAGCC CGCTGCGAAG CCGAAGCGCA AGCCCGCGAG GGCGAAGGCC CCCAAGGCCG AGCAGCCGGT CGCCGAGGCT ACGGCGACGC AGGAGGAGCC CGTCGGGGAA GCGCCGAAGC GGGAGTCCGA AGCGCCCGCG AAGCGCGCAC CGAAGCGTCC AGCCAAAGCG ACTGCGAAGG CTGTCGCCGA AGGCGCCGCT GCCCCGTCCG ACCCCGAGGC GTTCATCCGC CAGACCGTGG CCGCCGCCGA GCCGGAGGGG ATCGCGCTGT CCGTGCTGGG CAAGCGCGTG CGCGGCAAGT TCCGCACGTT CAAGCTGCGC GATCTGGGCT ACGCGCAGTT CAGGCCCTAT CTCGACGACC TGGACGGCAT CAAGGTGGAG CAGCGCGACG GCCAATCCTA CGCCCGCCTC GACCGATAA
|
Protein sequence | MDIKQSSEKR FALLIDADNV SAKYIKPITD ELSKYGTVTY KRIYGDWTLT LHAKWKDALL ENSITPIQQF GYTQGKNATD SAMIIDAMDI LYTRSVEGFC IVSSDSDFTR LASRIRESGL TVIGMGEKKT PTPFRKACDI FTTLELLLGD TGGKSGGRNR NRHDQGSSSN GQGAGTTTMS KDEIEQAVVN IITDNQNNGK STGLGEVGSR LLKRYPDFDV RSYGTNLLSK LLDEFASVQI IKDGSSVAVV LAEGANAPKD ASPEAEQAPE TKQADDVKDA PVAESEGSTD AQGAAEAKPV EKKPASRRQP RRRKDQVAAQ QGSEATEEKP VQEPELSAEP AGEQHDRLAE PVVEAEPAEQ PPSDNRPGRA ARMRAAASRS RGSEGRKQAG KKQTEKGERS DGEVPAQAAA PTEEQKPAAK PKRKPARAKA PKAEQPVAEA TATQEEPVGE APKRESEAPA KRAPKRPAKA TAKAVAEGAA APSDPEAFIR QTVAAAEPEG IALSVLGKRV RGKFRTFKLR DLGYAQFRPY LDDLDGIKVE QRDGQSYARL DR
|
| |