Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0245 |
Symbol | |
ID | 8414529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 333943 |
End bp | 335802 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023223 |
Product | membrane-flanked domain protein |
Protein accession | YP_003180626 |
Protein GI | 257790020 |
COG category | [S] Function unknown |
COG ID | [COG3428] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACT ATCGCCAACC GCCCCAACCT CCGCAGCCCC AGCCTCGGCC CGCTGCGCCC CAACCCCAGC CTGCTGCGCC GCAGCCTGAA GGGCCGCGGG GGAGCCATGT CCATCATAGC TATATCTGGT TGGGCAGTCT GCGCACGGCG TTCATGCTGC TGGCCATCGT GGTCTTCTCC TCGTTCTCGG CTATCATCGG CGCCATCTCC GAGGGCGAAG CCATCACGCG CGGCGACATC CCCATGCTTT TCATCGTCAT CGGATCCGTG ATCGCCGGTA TCGTCGTGCT GGTGGCGCTT GTGGCGGTCT ACCAGGTGAT CTCGTACAAG CATCTGTACT ACGAGCTAGG GCCCGAGGAA TTCAACCTGT ATTCGGGCAT ACTCAACAAG AAGCGCGTGC ATGTGCCCTA CCAGCGCATC CAGTCCGTCG ACCAGCACGC CACGCTCATC CAGCGCATCT TCGGCGTGTG CAGCGTCAGC ATCGACACGG CGGGCGGCGC GGCGAACAAG GCCGTCATCG TGCCGTATGT GCAGAAGACG CAAGCTGAGG AGCTGCGCCG CGAGCTGTTC GCGCGCAAGC AGTACGCCGT AGCCGTGCGC AACGGTGCCG CGCCCGATGC CGCCGTTGCG GCGATGGCCT CTGCAGCGGG CGTTCCCGCG CAAGCCTTGC ACGAGGGCGC CAATGTGCTG GACGCTCCCG CCGAGATCTG GCAGGACGTG CGCGGCGTGT TCGGTGGCGC CGCGGTGGAT ACGGGCCGCG TGACCTACGA GTACGGCATG TCCAACAAGG AGCTCGTGTT CACCGGCCTG TCGAACAACA CGGCGTTCTT CGTCGTGGTT GTCGGCATCG TCGGCGCCGT TTCCCAGTTC ATGGGTCAGA TGGCGCCCAT CCTCTCCGGT TCGATGGAGC CGCTCGTTGG CAACGTTGTA GCCACGAGCG TTCGGCTGTT CGGGGGCAGC CTCATCGCCG CGGGCGTGGC GACCTTCCTC GCTGCGTCGC TCGTGTTGTG GCTGCTGTCT GCCATCGGCG CTTGCGTCTC GTACGGCGGC TTCCGCGCAT GTCGCCGCGA CAACCGCATC GAAGTCGAGC ACGGCCTGCT GCAGCACCGC TTCCAGGGCG TCGACGTCGA CCGCGTGCAG TCGGTGATGG TGAAGCAGAG CTTCATCCGC CGTCTGCTTG GCTACTGCGA GCTGTCGCTG GGCAAGATCG ATGCGGCGGC TGAAAACTCC GACGACCAGC AGAAAGGCCT CAGCCAGCAA GGGCTCGTGA TCCATCCCTT CGTGAAGATG TCCCGCGTGC CCGAGATTCT CGCGGGCATC GTTCCCGAGT TCGCCGACGT TCCCACCGAG AACATCCCGG TGGCGCCCGT GGGGCTGCGT CGCGCCCTCA TCCGCCGATG CATCATCCAG GGGACGGGCT TCTGGTTGGC CGTCCTCGTG GCGGCAGGGC AGATCGCGGT GAACCTCCTG GCGGATCCGG CCGTGCCTGA CGGAGCCATG ACGTTGTTCT TCGTCAACAA CGGCGCGCTG TTCGGCTATG CGCTTGCGGT CGTGCTGCTC GTTCTCGACG CGGTAGGAGC CGTGCTGTGG TTCCGTGGCT CGGGGTTTGC GTACAACGAG CGTTTCATGC AGGTGAGCAA CGGCGGGTTC GCTCGCGAGA CCATCAGCTT CCCGCGCAAA AAGATACAGT TCGGTTACAC GAAGACGAAT CCTTTTCAAC GCAATGCCGG CACCGCTACG GTCAACGCGC GCACTGCAGC AGGGGTTGGA GGCACTACCA TCAGGCTTAT CGATGCCCGA GAGGATGATG CCCGTGCGTG GCTTGCTTGG CTCAAGCCCC ACGGAAATGT GATACAGTAG
|
Protein sequence | MTDYRQPPQP PQPQPRPAAP QPQPAAPQPE GPRGSHVHHS YIWLGSLRTA FMLLAIVVFS SFSAIIGAIS EGEAITRGDI PMLFIVIGSV IAGIVVLVAL VAVYQVISYK HLYYELGPEE FNLYSGILNK KRVHVPYQRI QSVDQHATLI QRIFGVCSVS IDTAGGAANK AVIVPYVQKT QAEELRRELF ARKQYAVAVR NGAAPDAAVA AMASAAGVPA QALHEGANVL DAPAEIWQDV RGVFGGAAVD TGRVTYEYGM SNKELVFTGL SNNTAFFVVV VGIVGAVSQF MGQMAPILSG SMEPLVGNVV ATSVRLFGGS LIAAGVATFL AASLVLWLLS AIGACVSYGG FRACRRDNRI EVEHGLLQHR FQGVDVDRVQ SVMVKQSFIR RLLGYCELSL GKIDAAAENS DDQQKGLSQQ GLVIHPFVKM SRVPEILAGI VPEFADVPTE NIPVAPVGLR RALIRRCIIQ GTGFWLAVLV AAGQIAVNLL ADPAVPDGAM TLFFVNNGAL FGYALAVVLL VLDAVGAVLW FRGSGFAYNE RFMQVSNGGF ARETISFPRK KIQFGYTKTN PFQRNAGTAT VNARTAAGVG GTTIRLIDAR EDDARAWLAW LKPHGNVIQ
|
| |