Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0655 |
Symbol | |
ID | 8414945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 832878 |
End bp | 834497 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645023630 |
Product | hypothetical protein |
Protein accession | YP_003181027 |
Protein GI | 257790421 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.507478 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0435014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAGA AGCTGATGGT GGTGACCGCG ATGCTGATGG CGGGATCGCT GTTCGCCGCC GGTTGCGCGC AGCAAGGAGC CGCGGAAGGG GAGAGCTCTT CGACCGAGGT TCAATATCAG AACAGCATCG AGGAGGCCGA AGCGTATCTT GACGGGGTGA ACGAGCGTCA GGACGAGCTG AACAAGGAGT ACGCTCCCGA AATCCGCACC TTGGAAGACG GCACGAAGGT TCAGCGCACG CCGACCGAGT ACCAGTGCTA CCACTGGAAC CGTCCGTACG AAGGCGGCAC GTCGTACAAC AACTACTATC TTGATGCCGA CAACCGCGGT TGCGGGGCTT GCCATGAAGA TCTCGGCGAC GCCCTGGCCA ACATGGAGTA CAGCCATCCG ACCGTCTGGA ACGACGCCCT CGGCAGTAAG ATCACGGTCG ACCAGTGCAT GCTCTGCCAT AGCGCAGATG ACGGCTACGA GATGGGCACC GTCATGCACG GCGTCCACTA CGGCGAACGC AACGGCGAGA ACTTCGAGGA GCGTGGCGGC AAATGCATCA GCTGCCACAA CATGACGGAG AACGAGCAGG GCGCCGAGCT TTGGGACGTG GTCAAGTACG ACCATCTTGA CGGCATCGTC AAGGTTCCCG ACGTCCAGGG CGAGTTCGTG TTCGATCAGG ACACCGTTGT GGCCCCCGAG GACATGTTCA CCTACGACTG GATCCATTCC CGCTACGACA GCCTGATCCA CATCATGGGC AAGAACGTGC TGGATACGGA GCTTCCGAAG GACTTCGTCG ACAATTTCGA GATCACCATG GACGGCCTGG TGAACAAGCC GTTCACGGCG AAGCTGGCCG ACCTGATCTC CGAAGCCGAG GCGGCCGGGG TCACCGTGAC GAAGCTTTCG AAGATCCACT GCGTTGACAA CATGCCCGGT GGCGGCGGCA TCTCGAACGT CGAGATCACG GGCATCCCGC TGACCTGGCT TATCGAGAAG GCGGGCGGCG CGAGCAAGGA CATCACCGGC GTGCTGTTCG ATCGTCAGGA GTTCCGCACC AACGGCGCCC AGACGCACAG CAACCGCGGC GTCCCCGAGG CTTCGTTCGA AGACGTGTAC CTCGTTTACG AGATCGGCGG CAAGCCGCTC GATCCCAGCC AGGGCAGCCC CTGCATCAAC TGGGTCGAGA AGTGCGACGC CCAGTCGTAC GTCAAGCAGT GCGTGGGCTA CAAGCTGACC GACGAGGAAA AGCCGTGGGA AGACTGGATG ATGAACGGTT TCAATTCGTA CGGCGAGGGC CCCTACATCA ACAAGCCGAA TGCGACGGTG CTCAAGGTTC CCGAGGGCAT GATCATCGAG ACCGGCAAGC CCTTCACCTT CGAGGGATAT GCGGACGCCT ACGACGAGGC GGTGGTCAAG CTGGAGTTCT CCATGGACAA CGGCAAGACC TGGACGCCGT ACGATCTGGG CCAGACCGAT CCTAGCAAGT GGGTGTACTG GCACTACACG TGGACTCCTG AAAGCGACGG CAGCTACGTG CTGATGGTGC GGGGCACCAC CGATACCGGC CTCGTGGGAA CCAACATCCA AAAGGTGATG GTGACCGCCA AGAGCAATGT GGAGGGATAG
|
Protein sequence | MRKKLMVVTA MLMAGSLFAA GCAQQGAAEG ESSSTEVQYQ NSIEEAEAYL DGVNERQDEL NKEYAPEIRT LEDGTKVQRT PTEYQCYHWN RPYEGGTSYN NYYLDADNRG CGACHEDLGD ALANMEYSHP TVWNDALGSK ITVDQCMLCH SADDGYEMGT VMHGVHYGER NGENFEERGG KCISCHNMTE NEQGAELWDV VKYDHLDGIV KVPDVQGEFV FDQDTVVAPE DMFTYDWIHS RYDSLIHIMG KNVLDTELPK DFVDNFEITM DGLVNKPFTA KLADLISEAE AAGVTVTKLS KIHCVDNMPG GGGISNVEIT GIPLTWLIEK AGGASKDITG VLFDRQEFRT NGAQTHSNRG VPEASFEDVY LVYEIGGKPL DPSQGSPCIN WVEKCDAQSY VKQCVGYKLT DEEKPWEDWM MNGFNSYGEG PYINKPNATV LKVPEGMIIE TGKPFTFEGY ADAYDEAVVK LEFSMDNGKT WTPYDLGQTD PSKWVYWHYT WTPESDGSYV LMVRGTTDTG LVGTNIQKVM VTAKSNVEG
|
| |