Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1147 |
Symbol | |
ID | 8415437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1378080 |
End bp | 1379660 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645024109 |
Product | PSP1 domain protein |
Protein accession | YP_003181506 |
Protein GI | 257790900 |
COG category | [S] Function unknown |
COG ID | [COG1774] Uncharacterized homolog of PSP1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.233245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCGCA TAGCGCCGAT TAACCTGTAT TACAACCCCA AGACGCTGTG GTTCGACGCC GGCGACCTGG ACGTGCGCGC CGGGGACGGC GTGATCGTGT CCACGGCCCG CGGCACCGAG TTCGGCCGCG CGGCCCACGA CGTGTTCGAG GCCGACGAGG CGCAGATCAA GAAGCTGAAA AGCCCGCTCA AGCCTGTCAA ACGCATCGCG ACGGACGAGG ACGAGGCGCG CGCGGCCGAG CTGGAGGCGA AGAGCCGCGA GGCGCTGCCC GTGTTCAAGG AGATGGCTGC CGAGGGCAAC GGCGACATGC ACCCCGTGTC GGTGGAGTAC CTGTTCGAGG GCGACAAGGC CATCTTCTAC TTCGAGGCGG AGGAGCGCGT GGACTTCCGC GAGCTCGTGC GCAAGCTGGC CGCGCACTTC CGCGTGCGCA TCGACATGCG ACAGATCGGC GTGCGCGATG AGGCCCGCAT GGTGGGCGGC CTGGGGCATT GCGGCCAGGA GCTGTGCTGC AAGCGCCTGG GCGGCGAGTT CTGCCCCGTG TCCATCCGCA TGGCGAAGGA GCAGGACCTC TCGCTGAACC CGCAGAAGAT ATCGGGCGTG TGCGGACGGC TCATGTGCTG TCTGCGCTAC GAGTTCGACG CGTACAAGGA CTTCAAGAGC CGCGCCCCGA AACAGAACGC CACGGTGGAG ACGCCCGATG GGCCGGCGAA GGTGGTGGAT CTCGACGTGC CGCGCGAGAT CGTGTCGCTG AAGATCATGG GCGAGAAGCC CGTGAAGGTG CCGCTGGCCG ACTTCGACCC GCCCGAGGAA GGCTCGAACC GCCCGAACCG CGTAGGCGAG GAGGCGTGGC AGGACGCGAC GACGGCCGAC CCTATCGGAT TTGCGGGCGA GTCGGCGCTG TTCGGCACCA CGACGCAGCT GACCGGGCAG GACAAGCTGG CCGATCCGGG TTCCGTGCGC CGCACGGGCC GCGGCGGTCA GAAGCCGTCG AAGGGCGGCG GCTCGAACGG CGGCCGCGCG GGCGGCGGCC AGAAGGGCGG CGGCAACGGC GGCCAGAAGG GCGGCAAACA GGCCGACGCG CAGGCGCAGA GCGCGCGCAA GCCGCGCAGG AGGCGCTCGA CGAAGGTCGG CGGCGAGGGC GCTGCCGCCC CCGAGGCGGC CGAGACGCAG AAGCGCAAGC AGAAGCAGCA AGGCGGCGGC TCGCCGAAGG GCGGGCAGGG CGGCCAGCAG CAGAAGCGCC GGTCGGGTCA GGGCGGTCAG AGCGGCAACG GCGGCTCCAA GAAGCAGCAG GGCCAGCGCC AGGGAGGCGA GGGCGCGAAG AAGCAGGGGC CGAAGGGCAT GCAGCCCTCG AAGCCTCGTC CCGGCCAGAA GTCCTCAGGC CTGCGCCAGG GCCAGAAGCC GCAGCAGCCG CGCCAGGACA AGGCGCCCCG CCCCGAGCGC TCGGGGGCTC CGAGCGGCGA GGGCGGCCGC CCGACGGGAG ACGGCGGGCA TCGCCGCGCC CGTCGCCGCA GCCACAAGGC GGGCGGCTCG GACGGCGCGG GCGCGCCCGG AGCGGGCGGC GCGGCGCCGA GCGGCGAATA G
|
Protein sequence | MVRIAPINLY YNPKTLWFDA GDLDVRAGDG VIVSTARGTE FGRAAHDVFE ADEAQIKKLK SPLKPVKRIA TDEDEARAAE LEAKSREALP VFKEMAAEGN GDMHPVSVEY LFEGDKAIFY FEAEERVDFR ELVRKLAAHF RVRIDMRQIG VRDEARMVGG LGHCGQELCC KRLGGEFCPV SIRMAKEQDL SLNPQKISGV CGRLMCCLRY EFDAYKDFKS RAPKQNATVE TPDGPAKVVD LDVPREIVSL KIMGEKPVKV PLADFDPPEE GSNRPNRVGE EAWQDATTAD PIGFAGESAL FGTTTQLTGQ DKLADPGSVR RTGRGGQKPS KGGGSNGGRA GGGQKGGGNG GQKGGKQADA QAQSARKPRR RRSTKVGGEG AAAPEAAETQ KRKQKQQGGG SPKGGQGGQQ QKRRSGQGGQ SGNGGSKKQQ GQRQGGEGAK KQGPKGMQPS KPRPGQKSSG LRQGQKPQQP RQDKAPRPER SGAPSGEGGR PTGDGGHRRA RRRSHKAGGS DGAGAPGAGG AAPSGE
|
| |