Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1953 |
Symbol | |
ID | 8416263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2292478 |
End bp | 2293617 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645024929 |
Product | hypothetical protein |
Protein accession | YP_003182306 |
Protein GI | 257791700 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000104092 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGGTA TCATCACGCA TTCGAATTCG AGCATGAGCA GGCAAGCTCT TCGACAGCAG GCGGAAAAGC ATCGTCTCGT GCGCCTGTTC CGTGGCGCGT ACATGGACGC GCAGGAGTAC GCGGCGCTCG ATATCTCTGG AAGATACCGG GCTCGTGCCC AGGCGTTTCT TGCGACGCAT GCGAAGCTTC GAGCGTGGGG GATCACCGCA GCCGCTCTTG AGGGCGCGCC GGTCCTCGGC GGAGCGCCTT TGCATTTCGG CGGCGCGCGA AGCCACGCCA AGAGCAAGCA GGACGGCTGC GCTTTTCACG AGGCTTCGCT TGAGACGCCA TCGAACCCGG TAGCGCAAAC GCTTTTCGAG TGCGCCTCGA CCTCTCCTTT GCCGGATGCG CTTTTGGCTG CGAATTATCT GTTGCGTCGT TCTTCCGCGA AAGCGCAGGG CGGTCTTGTG GCATGCAGGG ATATTGACGA GAGTACGACC GAAGCGCTCG TATGGGAGCC TGCAACCTCT GGGAGCGGGG AAGCGAGCAT CCGCTCTGCG TTCGATACCC GCATGCTCGA CATGGAAGAA CCTGAATTCC TTGCTAACTA CAGCGCTGCG CGCGTGACCG GATTCGTTTC GCCGGAAGCC GAGCTTCTGT GGCTCGCCTT CGCGCAGCTC TGCTTCGCCA ATGGAGGAAA ACGCGGAATA CGCAGCGCGT TGAAGGCGGG GCTGTACTTT ACCGACCAGG TCGAATCGCC GGCCGAGTCG TTTCTGATCG CCCGTTGCGT CGAACTTGGC TTCGAAATTC CCTATCTGCA GGTCAACATT CTCGACCCTT CGAACGGGAG GCATCTTGGT CGCGTCGACG GGCTTTGGCC TTCTGAAGCC GTACAGAAGA GCCTCTATCG AAGCGATAGC AGGTTCGGGC GCTTTCTCCA ATGCAGGCGG CTTGGAGACA ACGGCTCCAT CGTCATCGAC TTCGACGGCA AGCTGAAGTA CCGGCAGGAT TATGCCGAAA TTTTGGAAAG AGAGCGACAG CGGCAAAATG CCATAGGGAA TCTCGGGTTT CGGTTCGTGC GCATCGGCTG GGACGATCTC ATGCGGCCCG AGCGCTTGCG TTCGATCCTC GAAGCGGCTC GCGTCCCGCG TTGCAGGTGA
|
Protein sequence | MPGIITHSNS SMSRQALRQQ AEKHRLVRLF RGAYMDAQEY AALDISGRYR ARAQAFLATH AKLRAWGITA AALEGAPVLG GAPLHFGGAR SHAKSKQDGC AFHEASLETP SNPVAQTLFE CASTSPLPDA LLAANYLLRR SSAKAQGGLV ACRDIDESTT EALVWEPATS GSGEASIRSA FDTRMLDMEE PEFLANYSAA RVTGFVSPEA ELLWLAFAQL CFANGGKRGI RSALKAGLYF TDQVESPAES FLIARCVELG FEIPYLQVNI LDPSNGRHLG RVDGLWPSEA VQKSLYRSDS RFGRFLQCRR LGDNGSIVID FDGKLKYRQD YAEILERERQ RQNAIGNLGF RFVRIGWDDL MRPERLRSIL EAARVPRCR
|
| |