Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2638 |
Symbol | |
ID | 8416963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3059303 |
End bp | 3061495 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645025616 |
Product | phage minor structural protein |
Protein accession | YP_003182978 |
Protein GI | 257792372 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0220022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTAT TCGTCTGCGA CCGCTGGGAG AACTACAAGG GCGCGATCAA GACGCTCGCC TCGTGCATCG ACACCCGCGA GCTGAACGGA GAGAACAGCC TCGAGATCGT CTCGTTCGCG GTGCTCGACA AGGGCGACCG CATCGTTTGG CGCGACCTCA AGGGCCGTTG GCGCGAGAAC ATCGTCGGAA GCTGCGACGA GAGCCACGCC GACACGGGCG TGGAGCGCAC GCACTACTGC CCGAATTCCG CCATCGAGCT GCGCGGCGAC TACATCGAGG ACAAGCGCCC CGGCAACTGC TCGGCGGCGA CCGCGCTCGC CTCGGCGCTA TCGTCGTCGC GGTGGACGGT CGGCCAGGTC GATGACCTCG GCACGGCACA GGTCAACTTC TACCACGCGA GCGCGTGGCA GGCGATACAC GACGTGGCCG ACGCCTTCGG CGGCGAGCTG CGCTTCGATA TCGCGGTTTC CGGCTCGCGC GTCGTATCGC GCCGCGTGAG CCTGCTCGCG CACGTCGGGG CCGACACGGG CAAGCGGTTC ACGTACCGGA AGGACCTATC GGAGTGCAGG CGCACCGTGT CGGAGGACGA CGTGTGCACG GCGCTCTACG GCTGGGGCGC GTCCCTTGAG ACGACCGACG ATGACGGCAA CCTCACGGGC GGGTACTCCC GCAAGCTATC GTTCGCGGAC GCGAACGGCG GGGTCAAGTG GATCGGAGAC GACGCGGCCC GCGAGAGGTG GGGCCGCCCC GACGGCAAGG GAGGCAAGGC CCATGTCTTC GGCGAGGCGA CGTTCGACAA GTGCGAGGAC CCGGCGGAGC TGCTGCGCCT CACGAGGGCG GAGCTTGCCA TGCGCAGCGT GCCCAAGGTG AGCTATGCGG CGTCCGTCGC CGCGACCCGG AACGCAGGCG TCGGGTTCGA GGGGTCAGAC GAGGGCGATG CCGTGGCCGT CATCGACGGC GACCTCGGCG TGCGCGTGAT GGCCCGCATC ACGAAGATCA AGGAAGACCA GCTCGAAGAG GAGAACACGA CCTACGAGTT CGGAAACTTC GGCGACCTCG CGGACGTGTT CAAGGCCCAG AAGAGCGCGA TCCGCGAATC ATCGGATTCG GTGGCCTCCT ACGCGCAGAA CGCCGTCAAC GCCTCGAACG CGCAGACGAA CAGGAACATG GGCGCGCTCG GCGACAGGTT CGAGCAGGAG GTGAACGAGG CGATCAAGCA CGGCGACGAC CAGGTGGCCG CGCTCAAGGC CGAAATGGAC AAGATACCCT CCGATATCCG TGAACAGCTC ATAGACATGA TCAACGAGGA GGTTAACACG ACGGGCGGCT GGGTGTACGA GGAACCCGGC AAGGGCATAT TCGTGTACAA CTCGCGCCCG GAGAGCGCGA CGAAGTGCGT GAAGATCGGC GGCGGCATCG TGGCCGTGGC GAACAGCAAG TACAGCAGCG GCAATTGGTA CTTCAAAACC GCGATGACGG GCGACGGCAT CGTGGCCGAC CGCATCTATA CCGGCCGCAT AACCGGCGGC AGCTCGTACT TCGACCTCGA CAGCGGCACT ATCAACATGC GCAACGGCAT CATCAACATC ACCGACACCA ACGGGAACAC CGTAACGGTG TCGCCGTCGC TCGGGTTCCA AGTGCGCGAC AAGAACGGCT CCATGATCGC CGGAACGGTG ATAGCGAACG GCAAGGCGTT CTTCATGAGC CGCGCGGTTG GCGTGTCGTC GTCGCTCTAC GTGACGACAG GGACGACAGC GGCGGGGAAT CCAGGCGCTT CGTTCGTCAA CTCGCAGGGC AACTACCTCG ACGTCGAGGC GCTTCGCGCA TCGGCCGACC CGAGCAACAG GACGACGGGT GCGGGCATGG CCTGCTTCAA CAAGCCATTC CTGCACACGA GCACGTACTA CAAGCAGCTA TGGCTCCATC CGCCGTATTT CGACGACACG TATATGCAGC AGCCGAACGA GCAGCTATTC CTTCGCTGCG GAGGCAACAG CCTCGGCGAA TCGCCGCTCG TGAAGCTGCA ATACAGCAGT TCGCAAGGGC TGTTCTTCGG GAGCGGGTAC GTAACGCTCA AGCTCGACGA TTCGCACTAT GTTCGCATCT CGTCGGACGG CGTGCAATGC CGCTGCGGCT CGAAGGGCTT CGGATGGCTC AATGGCGAGT TCCACCAAAC GCTCGTCTGG TAA
|
Protein sequence | MQLFVCDRWE NYKGAIKTLA SCIDTRELNG ENSLEIVSFA VLDKGDRIVW RDLKGRWREN IVGSCDESHA DTGVERTHYC PNSAIELRGD YIEDKRPGNC SAATALASAL SSSRWTVGQV DDLGTAQVNF YHASAWQAIH DVADAFGGEL RFDIAVSGSR VVSRRVSLLA HVGADTGKRF TYRKDLSECR RTVSEDDVCT ALYGWGASLE TTDDDGNLTG GYSRKLSFAD ANGGVKWIGD DAARERWGRP DGKGGKAHVF GEATFDKCED PAELLRLTRA ELAMRSVPKV SYAASVAATR NAGVGFEGSD EGDAVAVIDG DLGVRVMARI TKIKEDQLEE ENTTYEFGNF GDLADVFKAQ KSAIRESSDS VASYAQNAVN ASNAQTNRNM GALGDRFEQE VNEAIKHGDD QVAALKAEMD KIPSDIREQL IDMINEEVNT TGGWVYEEPG KGIFVYNSRP ESATKCVKIG GGIVAVANSK YSSGNWYFKT AMTGDGIVAD RIYTGRITGG SSYFDLDSGT INMRNGIINI TDTNGNTVTV SPSLGFQVRD KNGSMIAGTV IANGKAFFMS RAVGVSSSLY VTTGTTAAGN PGASFVNSQG NYLDVEALRA SADPSNRTTG AGMACFNKPF LHTSTYYKQL WLHPPYFDDT YMQQPNEQLF LRCGGNSLGE SPLVKLQYSS SQGLFFGSGY VTLKLDDSHY VRISSDGVQC RCGSKGFGWL NGEFHQTLVW
|
| |