Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0713 |
Symbol | |
ID | 8415003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 890865 |
End bp | 892514 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023685 |
Product | hypothetical protein |
Protein accession | YP_003181082 |
Protein GI | 257790476 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.235142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAAT CCACTACGCC GAAGGCGCTC GCCTTGGGTG TTTCGCTCTC CCTCGCGTGC GGCCTGCTGC CGGGCGCGGC CTTTGCGGAC GAAGCCCCAG AAGACGATGA GTCCGCCAAC GCCGCCGATT CCGCTAGAGG AACGGCCGGC GGCGCCGACT CCGCCCCTTC GGGCGCGGCC GCCAAGCCCG ATGGCTACGA TAAAGCCGAC TTCTACTCCG ACAACCCCGT CGCGCCAATC GCACGGACGC TGGCCGTCCC GCTGGCAGCC AACACGCAGC TGCGCAGCCT CTCGAACGAG ATGAAGTACT TCGCCCGATT CGAGAGCAAC CAGAACTATG ACCAGGGGCT CTCATGGGGC GATGGCTACC ATGCAATGGG CTACTATCAA TTCGACAACC GATACTCGCT CAAAAGCTTT ATTTCCTATT GCTACAATTT CGACCCGGTC AAGTACCGGA TGTTCTCCTG GGTTCCCAGC GCGAACATTT CGGGAGACCT GTACGACGGA AACGCGGATC GGCTGACCGA CATCGGCCAG CAATTGAACG AATCCTGGCA TGCGGCATAC GCCGCCGACC CCGCGGAGTT CTCGGCACTG CAGGACACCT TCGCATACGA CAACTACTAC GTCCCCGCTG AAACGTACCT CGCAAGCCGC GGCATCGACA TCTCGAATCG CGCCGACAGC GTCAAAGGCC TTTGCTGGGG CCTGGCGAAC CTGTTCGGCA CGTCGGGATG GCACAAATTC GTGGGCGGTT GGTCCGACGG CTACGTGAAC GGAACATATT ACAACAGCTA TAACTACCCC GGCGCCGGGC TGACGAACGA CATGACCGAC GAACAGTTCG TCACCACGCT GTGCGATTAC GTGATAGCGC ACGTGGCCGA GTTCTACCAC GGCCAGCCGC AGTACCACGA AGGCTGGACG AACAGGTACA AACAGGAAAA GGAGCTGTGC CTGAGCCTGC TTCCCGAGAA GCCCACCGTG CCTCCTGTCG AGGAAGAGCC CCCTGCGGCG GATTCCCCTG CGGTTCCGCC GGTCGGGAAC ACCCCTCCCG AGGAAGGCAG CGGAGATGCG CCTTCGGGCG AGACGAAACC GCCCGAGAGC GTCCTGCCGC CCGAGGTGAA CCCGCCGGCA GCGCCCGAAG GGAACCCGCC GAGCGCTCCG AACGAACCCG CCCCGCCCGC GAACGACGGC GCCGGACAGG CCGATCCCGA CGAAACCCCG AAGCCGGACG AAGACAGCGC GGCTCCAACT CCGCCTGCAT CGCCGTCGAA CCCCGAAAAC ACGACGCCTC CCGCTGGATC CGAAACGACG CCGCCCGAGG AAACAAAGCC TCCGGAGAGC ACGACGCCGC CCGAAAGCCC GGAAACGACG CCGGGATCAG GTCTGCAAGA GCCGACGAAA GAAGTGGCGA ACAGCAGCAA GCCCTCCCTT GAAGAGCCAA CCGCCCCTGC GCAGGAGCCT GTCAACCTGC CCAAGGCCGA CGGAGACGAG AAGAACGGCA ACGAAACAGC CGCAAAAACC TCGAAGCTCA GCCAGACGGG CGACGGCACC ACGAACGCGC CGCTCGGCGT GGCGCTTGGA GCAGCCGGCA TCGCCGCGAT CGCAGGCGCC GCGCTCGCCA CACGACGCAT CCTCAAATAG
|
Protein sequence | MGQSTTPKAL ALGVSLSLAC GLLPGAAFAD EAPEDDESAN AADSARGTAG GADSAPSGAA AKPDGYDKAD FYSDNPVAPI ARTLAVPLAA NTQLRSLSNE MKYFARFESN QNYDQGLSWG DGYHAMGYYQ FDNRYSLKSF ISYCYNFDPV KYRMFSWVPS ANISGDLYDG NADRLTDIGQ QLNESWHAAY AADPAEFSAL QDTFAYDNYY VPAETYLASR GIDISNRADS VKGLCWGLAN LFGTSGWHKF VGGWSDGYVN GTYYNSYNYP GAGLTNDMTD EQFVTTLCDY VIAHVAEFYH GQPQYHEGWT NRYKQEKELC LSLLPEKPTV PPVEEEPPAA DSPAVPPVGN TPPEEGSGDA PSGETKPPES VLPPEVNPPA APEGNPPSAP NEPAPPANDG AGQADPDETP KPDEDSAAPT PPASPSNPEN TTPPAGSETT PPEETKPPES TTPPESPETT PGSGLQEPTK EVANSSKPSL EEPTAPAQEP VNLPKADGDE KNGNETAAKT SKLSQTGDGT TNAPLGVALG AAGIAAIAGA ALATRRILK
|
| |