Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0846 |
Symbol | |
ID | 8415136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1041262 |
End bp | 1042248 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645023812 |
Product | hypothetical protein |
Protein accession | YP_003181209 |
Protein GI | 257790603 |
COG category | [S] Function unknown |
COG ID | [COG2253] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.22305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATTG ACCTTGAAGA GGTTGCCAGC GATATCGCCA GCAAGGTCGA TGGCGTCCGG CTCACGCCCG TCATTGAAAA GGAAATCATC CATTACGAGA TTATCCGGTC ACTCGGGAGG AACGGCCTGC TTCGAGATAT CACCTTTCAA GGCGGGACCT CGCTGCGTCT CTGCTATGGA TCGCAACGAT ATAGCGAAGA CCTCGATTTT GTCGCCGGCG ACAGCTTCGA TTCGCTCCCC TTGGATGATT TTTCCAAGAC TTTGCGCTCC GATTTGCTGA AATCCTACGA TACCGAGGTT AGCGTCAGGG AACCCAAGGT CGTCAATGAT TTCGATGGCG TCGGGATGCG TAGATGGACG GTTTCCGTAA ATACCAACAT CGCGCGCCCC GATTTGCCGA AACAGCGAAT CAAGCTCGAG ATCGCGTCTG TGCCAGCACA TACATCCACG ATTCGACGTG TCGCGGTCAA TTATCCCGAA CTTGCCGGAA TGTATGACGA CCTCACGATC CGGTGCCAGA CACTCGAGGA AATCTTGGCC GACAAGCTCA TCTCGTTCTC CGCCACCGAT TCGCATATCA GGCATCGCGA CCTCTGGGAC ATTCCCTGGA TCGTCCGTGC ACAGGAGATC GATTTCCCTG CCGTAGCCGC ATTTGTCGCG GCAAAGCACG ATGACTACCG CTGTCCCGCC TCCCTTGCCA GCATGATCGC CACCGGGACG CAACGTGCAC GCGTCTGCTA TGTCGACGGG TCATTTACCG GACAGATGCA GCGTTTCCTA TCTCCCGCTG TCTTGAACCG CACGCACGAT TTCGATAATC ACTGCGATGC CCTCAATACA ATCGTCGAAA AGTGCTTCGA CCGCGTCGCC GCCTCCCTCG GCATCTCAGA TGAGGTCGAG CATTCCAGAC GCAAACTCGC AATCGAGATA TCTTCGGGAT CGATTTCGGC CGCCGCCCTA CCGAAAAGGA CTTTAGACCT TTCTTAA
|
Protein sequence | MLIDLEEVAS DIASKVDGVR LTPVIEKEII HYEIIRSLGR NGLLRDITFQ GGTSLRLCYG SQRYSEDLDF VAGDSFDSLP LDDFSKTLRS DLLKSYDTEV SVREPKVVND FDGVGMRRWT VSVNTNIARP DLPKQRIKLE IASVPAHTST IRRVAVNYPE LAGMYDDLTI RCQTLEEILA DKLISFSATD SHIRHRDLWD IPWIVRAQEI DFPAVAAFVA AKHDDYRCPA SLASMIATGT QRARVCYVDG SFTGQMQRFL SPAVLNRTHD FDNHCDALNT IVEKCFDRVA ASLGISDEVE HSRRKLAIEI SSGSISAAAL PKRTLDLS
|
| |