Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0723 |
Symbol | |
ID | 8415013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 910012 |
End bp | 911085 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023694 |
Product | sortase, SrtB family |
Protein accession | YP_003181091 |
Protein GI | 257790485 |
COG category | [S] Function unknown |
COG ID | [COG4509] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03064] sortase, SrtB family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000699088 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGT ACCGACAGCA TCCCGGCCCT GCTCCAAGCG GGCGACCTGC GCAGCAGCGC GTGCCGAGGG GCGCCGCGCC CGACGTGCGG GCAGGGGCGC AACCGGGCGC CTACCGGCCG GCCACCGCGC AGCATGCCGC CTACCGCTCC GCTCAGCCTG CCCAGCGCAG ATCGGCTGGA GGCGGAGGCT ACCCGTATCG ACAGCAACCG ACCTATCAGC AGCCCGGTTC CGGACGGCCT CGGAAGAAGA GCGGGCCGTG GCGCGTGGTG TTCTGGATCG CGCTCGTGGT GTTCGTCGTG GCGCTGGCCG CGCTCGGTGC CATCGGCTTC AGTTACTGGC AGGGTCAGCA AACCTATAAC GATGTCGCGC GGGAGGGTTT CACGCCGCCC GACGATCTGT CGGCCACGTC GCTCGCCGAC TTCACGGTGG ACTGGGATGC GCTCAAGGCC ATCAATCCCG ACACGGTGGG GTGGATCTAC ATTCCCGGCA CCGTGGTGAA CTACCCCATC GTGCAAGCGG CCGACGACGA GAAGTACCTT ACGCACGACT TCAAGGGATC GGAAGGCTGG ATCGCCACGT TCGGCGCCAT CTTCCTGGCG GCCGAGAACA GCTCCGACTT CTCCGACCCG AACAACATCA TCTACGGCCA TCATCTGAAC GACGGCTCGA TGTTCGCCTG CGTCGCCGAT TTCAGCGATG CCGCCCAGTT CAACGACCAT CGCACGGTGT ACATCCTCAC GCCTGAGGGC AACTACGAGC TGAGCACGTT CGCGCTCGTG CATGTGGCGG CCGACGATCC GTTGGCACAG ATGCGGTTCG CCGATGAGGA CGAACGCGTG GCCTACGTGC AGGACAAGAT CGACCGATCG GTGGTGCCCG CTTCCGACAT ACCTGCTGCT TCCGATATCA AACACACCTT CGCCCTTGCC ACGTGCGACA ATCTGCCGTC GGACGGGCGT TACGTGCTGT ATTCGTACGT GAAGGCCAGC ACGGTGGGCG AGGGCGACGG CGACGTCATC GACCCCGATG CCGTCGCGGC CATCGATTCC GCAGAACAGG AGATCGCTTC ATGA
|
Protein sequence | MSEYRQHPGP APSGRPAQQR VPRGAAPDVR AGAQPGAYRP ATAQHAAYRS AQPAQRRSAG GGGYPYRQQP TYQQPGSGRP RKKSGPWRVV FWIALVVFVV ALAALGAIGF SYWQGQQTYN DVAREGFTPP DDLSATSLAD FTVDWDALKA INPDTVGWIY IPGTVVNYPI VQAADDEKYL THDFKGSEGW IATFGAIFLA AENSSDFSDP NNIIYGHHLN DGSMFACVAD FSDAAQFNDH RTVYILTPEG NYELSTFALV HVAADDPLAQ MRFADEDERV AYVQDKIDRS VVPASDIPAA SDIKHTFALA TCDNLPSDGR YVLYSYVKAS TVGEGDGDVI DPDAVAAIDS AEQEIAS
|
| |