Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0721 |
Symbol | |
ID | 8415011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 907935 |
End bp | 909671 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023692 |
Product | protein of unknown function DUF344 |
Protein accession | YP_003181089 |
Protein GI | 257790483 |
COG category | [S] Function unknown |
COG ID | [COG2326] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000055361 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGAAA CCGTCGATTT CTCACGCGAA CCGCTCTCGA AGGACGCCTA CAAGGCGCGT CGGGACGAGC TGATGGAGCG ATTGGTGGTG CTGCAGCAGC AGGCGCGCGT GCAGGGCGTC GGCCTGGTGG TGCTGTTCGA AGGATGGAAC GGCGCCGGCA AGGGCAGCCG CATCTCCGAT CTCATGTACC ACCTCGACGC GCGCGCCACC AGCGTGTACG TCACTGAAAA CCTCGACGTG AAAGCCGCGC GCGCGTTCGC GGGCGCGAAG AGCGGCGTAA CGGGCTTCTA TCCCGTGATG CAGGAGTTCT GGAAGAGCCT GGGCCAGCGC GGCACCATCT CGTTCTTCGA CCGCGGCTGG TACACCGCCG CCGTTCAGCA CATGCTGTAC ACCGAGTTCG GCAAGCTCTC CCTGAAAGCC TCCAAGCGCA AGGGCCAGAA AGCCGTCGCG GCCGCCATGG CCGAGGCGCG CGACGAACGC CACATCGACG TGCTGCGCCG CTACCTCACC TCTGCGTCCG ATTTCGAGCG GCAGCTAGCC GACGACGGTT ACCTTGTGGT CAAGTTCTTC GTGCACGTCA CGAAGGAGGC GCAGAAGAAG CGCCTCACGC GCCTGCATGA CGATCCGGCC ACGCGCTGGC GCGTGGGCGA GGACAAGCTG GCCACCATCG GCAACTACGA GGAGGCGTAC CGCCTGTACG ACAACCTGCT GAAGGGCAGC GACTTCTCGT TCGCCCCATG GCATCTCGTG AACGGCGAGG ACAAGCGCCG CGCCAACCTG CAGATCGCCG AGACGCTGGT GAACGCGCTT ACGAGCGCGT TCGAGGCAGC GCCCGACGCC GAAGCCGCCG TAGCGGCGGC CAAGGCGCAG GCCAACTCCG CCGGCGCTCT CGATGAAGCG CCCCTGTTCG GCCGTTCTCC CGAAGAGGAG GCGCGCGTGC GCGAGGAGGC GGAAGCCGCC GCAGCCGCTG CTTCCGCCCG CGCTCCGCGC GTTTCGAGGT TCCGTCAGGT GGACGACCCG CCGTGCCTCG AGAGCGTCGA CCACGCGCTC GCGCTCGACC CCGAGACGTA CAAGGTCGAG CTCAAGGCCC AGCAGGAGCG CCTCAACAGG CTGGAGATGG AGATGTACCA GAAGCGCATC CCGCTCATGA TCATGTACGA AGGCTGGGAC GCCGCGGGCA AGGGCGGCAA CATCAAGCGC GTGGCCCAAG CGCTCGACGC CCGCGCCTAT ACCATTTTTC CCAGTCCCGC CCCCACGAAG CCCGAGCTGC TGCATCCGCA CCTGTGGCGC TATTGGACGC GTCTGCCGAA GGCGGGCCAC GTGGGCATCT ACGACCGCAG CTGGTACGGT CGCGTGCTCG TGGAGCGCGT CGAAGGTTTC GCTTCGGTGT CGGAATGGAC GCGGGCGTAC GACGAGATCA ACGAATTCGA GCGCGATCTG GTGCGGTGGG GCGCCATCCT GCTGAAGTTC TGGGTTGACG TGAGTCCCGA AGAGCAGTTG CGACGCTTTC GCGACCGCGA GCAAGATCCT GCGAAACAGT GGAAGATCAC CGATGAGGAT TGGCGCAACC GCGACAAGTA TCCCCAGTAC AAAGCCGCGG TCGAGGATAT CTTCCGCTTG ACCAGCACGC CGTTCGCCCC CTGGATAATC CTCGAGAGCG ACGACAAGCG CTACGCGCGC GTCAAGGCGC TCAAAATTAT CAACGACGCC CTGGAAGCGC GCTTGCGCGA AAACTGA
|
Protein sequence | MLETVDFSRE PLSKDAYKAR RDELMERLVV LQQQARVQGV GLVVLFEGWN GAGKGSRISD LMYHLDARAT SVYVTENLDV KAARAFAGAK SGVTGFYPVM QEFWKSLGQR GTISFFDRGW YTAAVQHMLY TEFGKLSLKA SKRKGQKAVA AAMAEARDER HIDVLRRYLT SASDFERQLA DDGYLVVKFF VHVTKEAQKK RLTRLHDDPA TRWRVGEDKL ATIGNYEEAY RLYDNLLKGS DFSFAPWHLV NGEDKRRANL QIAETLVNAL TSAFEAAPDA EAAVAAAKAQ ANSAGALDEA PLFGRSPEEE ARVREEAEAA AAAASARAPR VSRFRQVDDP PCLESVDHAL ALDPETYKVE LKAQQERLNR LEMEMYQKRI PLMIMYEGWD AAGKGGNIKR VAQALDARAY TIFPSPAPTK PELLHPHLWR YWTRLPKAGH VGIYDRSWYG RVLVERVEGF ASVSEWTRAY DEINEFERDL VRWGAILLKF WVDVSPEEQL RRFRDREQDP AKQWKITDED WRNRDKYPQY KAAVEDIFRL TSTPFAPWII LESDDKRYAR VKALKIINDA LEARLREN
|
| |