Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0119 |
Symbol | |
ID | 8414402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 158817 |
End bp | 162161 |
Gene Length | 3345 bp |
Protein Length | 1114 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023098 |
Product | M6 family metalloprotease domain protein |
Protein accession | YP_003180502 |
Protein GI | 257789896 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03296] M6 family metalloprotease domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.211845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGCT TCGCTCGCTC AAAGGCCGCG CGTTGGGCCG CGTTTTGGCT GACCGCGCTT CTGGCAGCAG CCTTCGCAGG TGCGCTCGCG CCCGCGTCGG CCCATGCGCA GGACGCGGCG CACTCGCCCT GCATCGCCAC CGTCGAGCAA CGCGCCGCCT ATGCCGCAGA CGGCACGCTG GACGAACGAG AAGCCTTCCA GGAAAGCCTC GGCAACGACG AGCCCTCCCC CGGCCTCGTC CAACAGGCTC TCGCGCGCGA GCAGGCGCAA AACGGCATCG CCCCGAACGC GGTTCCCGGC GACGACGGCG GCATGGCCGC TATCGGCAAC GCACACGTGG TGGCGCTGCG CGTGTCGTTC CCCGACCGCC CCTTCGCGGA AGGAGATACG CTGGAAGCGC TTCAGGCGCT CATAGGCCCG CGCGCCGTCG GCGAGGCCGC GCTGCCGAGC GCCGGCGGCT TTCCCTACGA GAACCTGCAC GACTACTACC TCCGCGCGTC GTACGGCGCG CTTACCGTCA CCGGCGAAGC GTTCGACTAC GCGGCCCAGC ACGAGCGCGA TTTCTACACG GCGAACATCG GGCAACTGTA CAAGGAGGCG CTCGACCACC TCGAGGCATC CGGCATCGAT CTCGCGCGAT TCGACGCGAA CGGCGACGGG CGCATCGATG CCGTCTACCT GCACTTCGCC GGCGGCGACA CCGGCTGGGG CAGCGTCTGG TGGAGCAACG AGCAGGTGCT CGACGTCCCC GACGCCGTGT ACGCTGACGG CACAGTGCGC CTATGGAACG CCGTGGCGCT GTCGAATCCG TGCGACCAAC CGTGGGCCGC GCAGACCATC ATCCACGAAA CGGGTCACGT GCTGGGCCTG CCCGACTACT ACCAGTACGC CAGCCAGCAA GGAGGCTCCG CCGACCGCAC GGGAATCCTC ACCTTCGACA TCATGATGCA GAACCAGGGC GATCACAATG GATTCTCGAA ATGGATGCTG GGCTGGCTGC CCGACAGCAA GATCACCCGC ATCTTCGCGA ACGAGACGGG CATCGACGTC AAACGCGACG GCAAGGTGGT GCAACACGTG GACGCAACCG CCGACGGCAG CTCGTCCGTA GAAGCGGCTT TAGAAGCGTT CGCTTCGAAC GACATCGATG AAACCGGCGG TATCGTCGTG GTGGCGAACC AAGACGCAAG CATGTTCTCG TCCTACTACC TGCTGCAGTA CGACCGCTTC GCCGGCAACC AAAGCGTTCG CTACGAGCAA GATGGCCAAC CCGCCGAGCT TCCGTCGGGA TTTCGCCTCT TCAGAGTTCA GGCCGACCTA GGATCGGGCG GACCGTATTT CGTTCATTCG AACACGTACG GAACCGTGCA CAACCAGCTG ATCGAGCTGG TCGACCCTGA TATGAACGAG CCGCACGCGG AAAGCACCGA CCTCGTCCCA GCCGCAATCG GCTCGAGAGA ATACGGCTGC ATGCTGTACG CAGGCGACGA GGTGTCTCCC CGAGGTTACC CTTCAACCAA CTTTTTCGAA AACGCCAACG TCGGATTCAC CGGCCTGACC ATCGCCGTCA CCGAAAGCCG TGCGGATGGC GGCACGCTGA ACATCTCGTA CTCGAACGCC GGCAAGCCGG AACCGCCCGA CGATTTCACG CTTACGCCCC TGTTCGACAG CGTCTCGAAC ACGGGGACGC TGTCGTTCGA AGCGTCCACG AAGCCTGCCC TCGCCACGCC GCTGCCCGCG GCAACGCAGC TCATCGTGGA CGGCCAGCCC CACGCCCTGC TGAACATCGA GACGGACGGC ACCACGGTCA GCCTTCCGTA CTTTCTCGAT GCCGGCGATA TCGGGCCTTC GTCAACGTGC GAGATCGTGT TCCCTGCCGG CATGTTCGTC TTGGCTCAAA CAGGCGGCAC GACTGCATAT TCTCCCGAAG TGCGCCTCGA GCTGAAAGCA AGCCCGCTGC TCACCCCCGT CGACTGCGCG GGCGGCTACC TGGGCACCGA GTACACCGAA GATTCCACGG TGCTCAGCAA CGTGTTCGCC TGCCCCGACG GAAGCAAACG GTTCTTCCAG ATAGCCGACG GCCAGCTCAA GCTGCACGCC ATCGACCCCG CCGATGTCGC ACGCGTATCC TCGCGCACGG TCGAGGGCGT TCCCGTTCCC GCCGTCTTGG ACTACCGTCC TTCGCTTCGC GTCGTGCCGC TATCGAACGA GACCGCGTTC GTGCTCATGC CTGCAGCGGG AGGCGAAAGC GCGGGATACT GGATCGATCT CCAGCGCGGA ACCGTTATCG CGTCCTATCC CTTCGACCAA GCCGCTTCCC TCTCGCTTGC AGGCAGCGGG TCCACCGTAT TAGCGGCGTC TTTCTATCCA GGCGGCGGCC GAACGATCGC CGCGCTCACC CCGCTTGAAG ACGGAACCGT GGAAGCGCGC TACGGATGGA CCCAGGCCGA AACCGTGGTT CCCGTCGACG CCGACGCGTT CGCCTTCGGG TTCATGGATG ACGCAGGCGC TTTGACGGAG GAAGCGCGCA TCCTGCCCGC CGACGCCATC GTCGCGGCGC TGCGCGATGG CGCAACCCTT CTCGAGCATG CCTCCGACAA TGGCGAGCTG TGCACGCCCG AGCGCGCCGC GGCCACGCTG CCTCTGCGCG CCAGCCGCAT GCTGCTGGCC GTCGAGCGCG CGAAGGACGG CTTCTACGCG CTGCTTTCTT CGGACTACTT GGATCCCGCG AAGCTGACGA ACCTGCTGAT CGAGTTCGAC AGCACGGGAG CGGAACGGGC ACGATCTCCT ATCGAGGTGG ACGGATCCGA CGTATACATG CAACTTCGGG TCGCTCGCCA CGGCACCGTG GCGCTGTCGC GGCGCATCGA CATGACGAAA CCGCCCATCT CCCGCAGCGA CGTTCTGTTC TGCGATGCGA ACCTGCAACC GCTCTCCGTG CTTACTACAG CATCCACCGG CTTGGGCGCA TGGCTCGACG ACGGACGCTG GCTCGACGTG GGAATGAGCG TTGCGGCGGC AAACGGGCCG AAGGGGTCGC CCGTCGCAAC GGAATACGGC GGCGCAACCG AAGGCGGCGA GGTCGACGAA TCCAAGCGCG CCCGATACGT GGTAACGACG CAACTTGACA AAACTCCTGC AAACCCAGGC GATGACCCCG CCGATCCCGG CATTCAACCG CAACCGCTCC CGCAACCAGC CGACGACCAG CCCGACAGCC CAAGCAAGCT CGCGCCTACC GGCGACCACG CGAATGCGCT CGAGCTTGCC GCCCTGGCAA CGGCAGCGCT GGCCACCGCG CTCTTCGCAC TCATGAGGCA AAATGAGGCA AAGGAATGGG GTTGA
|
Protein sequence | MQRFARSKAA RWAAFWLTAL LAAAFAGALA PASAHAQDAA HSPCIATVEQ RAAYAADGTL DEREAFQESL GNDEPSPGLV QQALAREQAQ NGIAPNAVPG DDGGMAAIGN AHVVALRVSF PDRPFAEGDT LEALQALIGP RAVGEAALPS AGGFPYENLH DYYLRASYGA LTVTGEAFDY AAQHERDFYT ANIGQLYKEA LDHLEASGID LARFDANGDG RIDAVYLHFA GGDTGWGSVW WSNEQVLDVP DAVYADGTVR LWNAVALSNP CDQPWAAQTI IHETGHVLGL PDYYQYASQQ GGSADRTGIL TFDIMMQNQG DHNGFSKWML GWLPDSKITR IFANETGIDV KRDGKVVQHV DATADGSSSV EAALEAFASN DIDETGGIVV VANQDASMFS SYYLLQYDRF AGNQSVRYEQ DGQPAELPSG FRLFRVQADL GSGGPYFVHS NTYGTVHNQL IELVDPDMNE PHAESTDLVP AAIGSREYGC MLYAGDEVSP RGYPSTNFFE NANVGFTGLT IAVTESRADG GTLNISYSNA GKPEPPDDFT LTPLFDSVSN TGTLSFEAST KPALATPLPA ATQLIVDGQP HALLNIETDG TTVSLPYFLD AGDIGPSSTC EIVFPAGMFV LAQTGGTTAY SPEVRLELKA SPLLTPVDCA GGYLGTEYTE DSTVLSNVFA CPDGSKRFFQ IADGQLKLHA IDPADVARVS SRTVEGVPVP AVLDYRPSLR VVPLSNETAF VLMPAAGGES AGYWIDLQRG TVIASYPFDQ AASLSLAGSG STVLAASFYP GGGRTIAALT PLEDGTVEAR YGWTQAETVV PVDADAFAFG FMDDAGALTE EARILPADAI VAALRDGATL LEHASDNGEL CTPERAAATL PLRASRMLLA VERAKDGFYA LLSSDYLDPA KLTNLLIEFD STGAERARSP IEVDGSDVYM QLRVARHGTV ALSRRIDMTK PPISRSDVLF CDANLQPLSV LTTASTGLGA WLDDGRWLDV GMSVAAANGP KGSPVATEYG GATEGGEVDE SKRARYVVTT QLDKTPANPG DDPADPGIQP QPLPQPADDQ PDSPSKLAPT GDHANALELA ALATAALATA LFALMRQNEA KEWG
|
| |