Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1469 |
Symbol | |
ID | 8415767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1757256 |
End bp | 1759409 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024438 |
Product | RNA binding S1 domain protein |
Protein accession | YP_003181827 |
Protein GI | 257791221 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCCA TCCAAACCAC CATCGCCCGG GAGCTGAACC TGACCGCCGC GCAGGTCGCG GCCGTCATCG ACCTCGTAGA CCAAGGGAAC ACCATCCCGT TCATCGCTCG CTACCGCAAG GAGGCCACCG GCGGCATCGA CGATGCGACG CTGCGCGATC TGGACGAGCG CCTGACCTAC CTGCGCAACC TCGAGGCTCG CAAGGACGAG GTGCTGCGCG CCATCGAGGA GCAGGGCAAG CTCACGGCCG ACCTGCGCGC GAAGATCGAC GAGGCCACGG TCATGCAGCG CGTCGAGGAC CTGTACAAGC CTTATCGGAA GAAGCGCGCC ACCCGCGCCT CGAAGGCGCG CGACGCCGGG CTCGAGCCGC TCGCGCTGCT CATCCTCGCG CAAGACCGCA GCGCCAAGGA CCCGCTGGCC GTGGCCTCCG GCCTCGTGAA CCCGGAGGCG GGATACCCCA CGCCTGAGGA TGCGCTGCAA GGCGCGCAGG ACATCGTGGC CGAGGTCGTC GCCGACGACG CCGAGCACGT CGCCTGCCTG CGCGCGGCCA CCAAGCGAAA CGGCGCGCTT TCCGTCGAAG CCGTCGACGC GTCCGAGAAA ACCGTCTACG AGGCGTACTA CGACTTCTCC GAACCCCTGT CGCGCATCCC CAACCACCGC ATCCTCGCCG TCAACCGCGG CGAGAAGGAG AAGAAGCTCA AGGTGAAAGT GCGCACCGAC GCCGATGCGG CCATCAGCCA GCTGGAACGC CGCATCCTGC GCGGCGACGG TCCCTTCGCA GCCCCTTTGA AAGCCGCAAT CGCGGACGGC TACAAGCGCC TGATCGCGCC GTCGGTGGAG CGCGACCTGC GCGCCGAGCT CACCGAGCGC GCCGAGACCG ATGCCATCCG CGTGTTCGCC AAGAACACCG AGAACCTCCT GCAGCAGCGT CCCGTGCGTG GGGCGCGCAT CATCGCGCTC GACCCCGGCT ACCGCACGGG CTGCAAGGTG GCCGTGCTGG ACGAGTACGG CAAGCTGCTC GACCACACCA CGGTCTACCC CACCCCGCCG CGCTCCCAGG TAAAGGAAAC GCAGGCCCAG CTGGCCGCCT ACGTCGAGAA GCACCGCATC AACGTCATCG TCATCGGCAA CGGGACGGGC AGCCGCGAAA CCGAGGAGGT GGTGGCCGAC TACATCGCCC GGTCGAAGGC GCCCGTGCGC TACACCATCG TGAACGAGGC CGGCGCCTCG GTGTACTCGG CCTCCAAGCT GGCAAGCGAG GAGTATCCCG ACCTCGACGT CACCACGCGC GGCGCCATGA GCCTGGGGCG CCGCCTCCAG GATCCGCTGG CCGAGCTCGT GAAGATCCCG CCCCAGGCCA TCGGCGTGGG CCAGTACCAG CACGACCTCA ACCAGGCGGC GCTCGAGCGC GCGCTGACGG GCGTGGTGGA GAACGTGGTG AACCGCGTGG GCGTGGACCT CAACACCGCC AGCGCGAGCC TGCTGGGCTA CGTGTCGGGT ATCAGCGCCG CAGTGGCCAA GAACATCGTC GCCTACCGCG AGGAACACGG CGCGTTCACC GACCGGGGTC AGCTGAAGAA GGTGCCGAAG CTGGGCGCGA AGGCCTTCCA GAACTGCGCG GGCTTTCTGC GCATCTCGGA CGGCAAGAAC CCGTTGGACG CCACGAGCGT GCACCCCGAA AGCTACGCCG TCGCAAACGA ACTGCTCAAG CGCGCGAAGG TGAAGCCCGA AGCGCTCGCC GACGGCGGCG TCCCCGACTT CGCGAGCCGC CTCGGCGACG TGGACGCGCT GGCGGCCGAG CTGGGCGTTG GCGCACCCAC GCTGCGCGAC ATCGTCTCCG AGCTGGAAAA GCCGGGCCGC GACCCCCGCG ACGATGCGCC GGAGGTCGTG TTCAGCGAGG GCGTGCGCGA CTTCGACGAC CTGACCGTGG GAATGGAGCT TACCGGCACG GTGCGCAACG TCGTCGACTT CGGCGCGTTC GTGGACGTCG GCGTGAAGCA GGACGGCCTC GTGCATGTCT CCAAGATGGC CGACCGCTTC GTGCGCCATC CGAGCGAAGT CGTGGCCGTG GGCGACACGG TCACGGTGTG GGTGACGGGC ATCGACAAGG ATCGCGGCAG GATCTCGCTG TCCATGGTGA AAGGCAAGGC GTAG
|
Protein sequence | MPSIQTTIAR ELNLTAAQVA AVIDLVDQGN TIPFIARYRK EATGGIDDAT LRDLDERLTY LRNLEARKDE VLRAIEEQGK LTADLRAKID EATVMQRVED LYKPYRKKRA TRASKARDAG LEPLALLILA QDRSAKDPLA VASGLVNPEA GYPTPEDALQ GAQDIVAEVV ADDAEHVACL RAATKRNGAL SVEAVDASEK TVYEAYYDFS EPLSRIPNHR ILAVNRGEKE KKLKVKVRTD ADAAISQLER RILRGDGPFA APLKAAIADG YKRLIAPSVE RDLRAELTER AETDAIRVFA KNTENLLQQR PVRGARIIAL DPGYRTGCKV AVLDEYGKLL DHTTVYPTPP RSQVKETQAQ LAAYVEKHRI NVIVIGNGTG SRETEEVVAD YIARSKAPVR YTIVNEAGAS VYSASKLASE EYPDLDVTTR GAMSLGRRLQ DPLAELVKIP PQAIGVGQYQ HDLNQAALER ALTGVVENVV NRVGVDLNTA SASLLGYVSG ISAAVAKNIV AYREEHGAFT DRGQLKKVPK LGAKAFQNCA GFLRISDGKN PLDATSVHPE SYAVANELLK RAKVKPEALA DGGVPDFASR LGDVDALAAE LGVGAPTLRD IVSELEKPGR DPRDDAPEVV FSEGVRDFDD LTVGMELTGT VRNVVDFGAF VDVGVKQDGL VHVSKMADRF VRHPSEVVAV GDTVTVWVTG IDKDRGRISL SMVKGKA
|
| |