Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1091 |
Symbol | |
ID | 8415381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1320106 |
End bp | 1321185 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645024054 |
Product | oxidoreductase molybdopterin binding |
Protein accession | YP_003181451 |
Protein GI | 257790845 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000000000000278852 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTTACA ACCCCAAGAA GATCGCGGGC GGCGTGGCCG GCGGCGCGCT GCTGATGTTC GGAGCAGTGG GCGGTATCGG CGCTACGGCC GGACTACCGC AGACCGCCGA CGCGGACCCG ATCGCCACGA CGCATGCCGT GGGAGACAAC GTCGTGACCG CACAATGGTT CAACGAGGAC GACGTGTCCT ACGTCTCCGT GGCCAACGCC CAGGGAGCGT TCACCTTCAA CCAGGAAGGC GTCACCCCCA ACGACGAGCT GTTCAACGTG TTCGGAACAG CGCTGACCAG CATGTGCTCG AAGCCTGCCA CCGAGCTCGT CGACGTGGAG GGCGGCGTGG CCAACTTCTA CGTGAACGTG GGCGGCAACA TCAAGAAGAA CTTCACCATC GACGTGCGCG ACCTGGCCGA AGATGCCAAC CAGGAGACGA TGATGGCGTG CTCGTGCGCC ACCGGATCGC CCTTCGGCCA GGCGGCCGTC ATGGGCGTGC CGCTGTCGGC CGTGGTGGAG ATGGCCGACC TGGAAGACGG CGTGAACACC ATCACCGCCT ACGGCGCCGA CGGCTTCGGC CAGCCGCTGC CCTTGCGCTA CGCGCTGGAG AAGAACGCGC TTCTGGCGTA CCAGGTGAAC GGCCAGGAGC TGGAGGCGGC CACCGGGTCG AGCCTGCAGC TGTGGATGCC CGAGACCGTG GCGCGCTACT TCACGCGCGA CATCGTGAAC ATCGAGCTGA CGCAGGAGGA TGCGGAGCCC GACGTGCAGC AGGTGGACCC GTGCTACCGC AACAAGATCA ACATCATGAA CTACTCCGAC GACTGCGTGT TCAAGGCAGG CGACGAGATC ACGTTCGAAG GCGTGGCCGA CGATCTGGGA AGCCCCATCG CCGCCATCGA GTTCTCATTC GACAACGGCC GCACATGGAC GTCGTGCGAC ACCGACGGCG CCACGGCCGA CAAGTGGGTG AACTGGCAGT TCACCACGTC GTTCGAGGAA AAGGGCGACT ACCGCATGAC CGTGCGCGCG AAAACCGCCG ACGGCATGGT GTCCCCGCTT GCGGCGACGC TGCTGTTCGA AGTGGCCTAA
|
Protein sequence | MAYNPKKIAG GVAGGALLMF GAVGGIGATA GLPQTADADP IATTHAVGDN VVTAQWFNED DVSYVSVANA QGAFTFNQEG VTPNDELFNV FGTALTSMCS KPATELVDVE GGVANFYVNV GGNIKKNFTI DVRDLAEDAN QETMMACSCA TGSPFGQAAV MGVPLSAVVE MADLEDGVNT ITAYGADGFG QPLPLRYALE KNALLAYQVN GQELEAATGS SLQLWMPETV ARYFTRDIVN IELTQEDAEP DVQQVDPCYR NKINIMNYSD DCVFKAGDEI TFEGVADDLG SPIAAIEFSF DNGRTWTSCD TDGATADKWV NWQFTTSFEE KGDYRMTVRA KTADGMVSPL AATLLFEVA
|
| |