Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2208 |
Symbol | |
ID | 8416530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2593582 |
End bp | 2595240 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645025193 |
Product | ErfK/YbiS/YcfS/YnhG family protein |
Protein accession | YP_003182558 |
Protein GI | 257791952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.827056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGCCAG AACACGAAAC GCGCGGCAAG CACGAAGCGC AGCAACCGGC GCCGCAGCCA GCGCCGCGCC CGGCACCGCA GCGAACGCGG CGCATCGACT CGATCGACCC TCCCTTGGCC GACGGCGCGC CGGAAAGCTA CGTGTCGTTC GTGCCCGCGT CGCGGGGCGG CCGACCGGGG CCGGTTCGCG GCGGGCGCCC GGAGTCGGCG GACGGCGGCC GACAAGGGTC GTCGAACCGG AAGCCTCCGT TCGCGAAGAA GACCGGGCTC ATTGCGGGCG GCGCGCTCGT CGCCGTGATC GCGATCGTCT ACCTGGCGGG CGCGCTCGTG TTCATGGACC GCTTCATGCC CAACACCACG ATCATGGGCA AGGACGTGTC GTTGAAAACC ACCGCCGAGG TTCAGGATCT GTTGACCGAC GTGGCGAAAA GCTACCAGCT GAGCGTGTCG GGCGAGGGGT TCTCGCTGAC GCTCGCCTCG TCCGACATGG GCACGGCCGT CAACAGCAGC AGCGTGACGG ATGCCATGCA CGCCGATGCC AGCCCGTGGG CCTGGCCGGT CGAGGTATTC ACCGTGCACG ACGAAACCGA CAAGCTGGTC ACTTCGAACG GCAAGCTGGA CGAAGCCGTG CGGAAGGCCG TCGAGGCGTT CAACGAGCAG GCCGAAGCGC CGCGCAACGC CGGGCTCGAC TACGACAGCG GCGGCTCTCG GTTCGTCGTG CGCGCCGAGA CCGTCGGCAC TGCGCTCGAT GCCGATAAGG TGGCCGAGAC GGTGAACGCG GCCGTCGCTG CGATGGGGTC GAGCGCGACG CTGTCCGAGG ATGCGTTGCA GCAGCCCACG CTGCTGTCCG ACGATGAGCG TCTGGCGAAG GCGGCCGATG AGGCGAACAA CCTGTTGAAA GCCGATTTCT CGCTGAAGCT GGGCGATACG CCCGTGGCGC AGGTGAATGC CGATGCCATC GCCGGTTGGG TGCGCTTGCA CGACGACGTG ACGGTGGGCG TCGACGAGGC GCTGGTGGCA GCCTGGGTGC AGGATCTGGC TTCGGCGTGC AACACCTACC AGGCGCGTCG CACGTTCACG CGCGCCGACG GCAAGGAGGT GACCGTGTCA GGCGGCGTAT ACGGCTGGAT CATCGACAAG GGCAAGCTGC AAGAGGCGGT GACGAACGGC GTGGGTTCCG CGCAGACCGG CGACATGGCC ATCCCTTGCG AACAGGAGGC GGGCGCCTAC GACGGGCTGC ACGGACGCGA TTGGGGCAAG CGCTACGTCG ACGTCGATCT GACGGAGCAG CACGCGCGCT TCTACGACGA CGAAGGCTCG CTCGCGTGGG AGTCCGACGT GGTCACCGGC ACGCCGGACG GCGAGCACGA CACCCCCGAG GGCGTCTACG TAATCAACGG CAAGGAGAGC CCGTCGAAGC TGATCGGCCA GATGAAGCCC GAGACGGGCG AGCCGGAATA CCAGACCGAG GTGAAGGCCT GGATGCCGTT CGTAGACAAC TACATCGGGT TTCATGATGC CGATTGGCAG CCCGCGTTCG GCGACTCGCG GTACAAGTCG GGCTACGGCA GCCACGGCTG CGTGAACCTG CCGCCGGAGA AGGCGGTGGA GCTGTACGAC CTTATCAAAG TGGGCGACGT CGTGGTCAGC CACTGGTAG
|
Protein sequence | MTPEHETRGK HEAQQPAPQP APRPAPQRTR RIDSIDPPLA DGAPESYVSF VPASRGGRPG PVRGGRPESA DGGRQGSSNR KPPFAKKTGL IAGGALVAVI AIVYLAGALV FMDRFMPNTT IMGKDVSLKT TAEVQDLLTD VAKSYQLSVS GEGFSLTLAS SDMGTAVNSS SVTDAMHADA SPWAWPVEVF TVHDETDKLV TSNGKLDEAV RKAVEAFNEQ AEAPRNAGLD YDSGGSRFVV RAETVGTALD ADKVAETVNA AVAAMGSSAT LSEDALQQPT LLSDDERLAK AADEANNLLK ADFSLKLGDT PVAQVNADAI AGWVRLHDDV TVGVDEALVA AWVQDLASAC NTYQARRTFT RADGKEVTVS GGVYGWIIDK GKLQEAVTNG VGSAQTGDMA IPCEQEAGAY DGLHGRDWGK RYVDVDLTEQ HARFYDDEGS LAWESDVVTG TPDGEHDTPE GVYVINGKES PSKLIGQMKP ETGEPEYQTE VKAWMPFVDN YIGFHDADWQ PAFGDSRYKS GYGSHGCVNL PPEKAVELYD LIKVGDVVVS HW
|
| |