Gene Elen_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0119 
Symbol 
ID8414402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp158817 
End bp162161 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content65% 
IMG OID645023098 
ProductM6 family metalloprotease domain protein 
Protein accessionYP_003180502 
Protein GI257789896 
COG category 
COG ID 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCT TCGCTCGCTC AAAGGCCGCG CGTTGGGCCG CGTTTTGGCT GACCGCGCTT 
CTGGCAGCAG CCTTCGCAGG TGCGCTCGCG CCCGCGTCGG CCCATGCGCA GGACGCGGCG
CACTCGCCCT GCATCGCCAC CGTCGAGCAA CGCGCCGCCT ATGCCGCAGA CGGCACGCTG
GACGAACGAG AAGCCTTCCA GGAAAGCCTC GGCAACGACG AGCCCTCCCC CGGCCTCGTC
CAACAGGCTC TCGCGCGCGA GCAGGCGCAA AACGGCATCG CCCCGAACGC GGTTCCCGGC
GACGACGGCG GCATGGCCGC TATCGGCAAC GCACACGTGG TGGCGCTGCG CGTGTCGTTC
CCCGACCGCC CCTTCGCGGA AGGAGATACG CTGGAAGCGC TTCAGGCGCT CATAGGCCCG
CGCGCCGTCG GCGAGGCCGC GCTGCCGAGC GCCGGCGGCT TTCCCTACGA GAACCTGCAC
GACTACTACC TCCGCGCGTC GTACGGCGCG CTTACCGTCA CCGGCGAAGC GTTCGACTAC
GCGGCCCAGC ACGAGCGCGA TTTCTACACG GCGAACATCG GGCAACTGTA CAAGGAGGCG
CTCGACCACC TCGAGGCATC CGGCATCGAT CTCGCGCGAT TCGACGCGAA CGGCGACGGG
CGCATCGATG CCGTCTACCT GCACTTCGCC GGCGGCGACA CCGGCTGGGG CAGCGTCTGG
TGGAGCAACG AGCAGGTGCT CGACGTCCCC GACGCCGTGT ACGCTGACGG CACAGTGCGC
CTATGGAACG CCGTGGCGCT GTCGAATCCG TGCGACCAAC CGTGGGCCGC GCAGACCATC
ATCCACGAAA CGGGTCACGT GCTGGGCCTG CCCGACTACT ACCAGTACGC CAGCCAGCAA
GGAGGCTCCG CCGACCGCAC GGGAATCCTC ACCTTCGACA TCATGATGCA GAACCAGGGC
GATCACAATG GATTCTCGAA ATGGATGCTG GGCTGGCTGC CCGACAGCAA GATCACCCGC
ATCTTCGCGA ACGAGACGGG CATCGACGTC AAACGCGACG GCAAGGTGGT GCAACACGTG
GACGCAACCG CCGACGGCAG CTCGTCCGTA GAAGCGGCTT TAGAAGCGTT CGCTTCGAAC
GACATCGATG AAACCGGCGG TATCGTCGTG GTGGCGAACC AAGACGCAAG CATGTTCTCG
TCCTACTACC TGCTGCAGTA CGACCGCTTC GCCGGCAACC AAAGCGTTCG CTACGAGCAA
GATGGCCAAC CCGCCGAGCT TCCGTCGGGA TTTCGCCTCT TCAGAGTTCA GGCCGACCTA
GGATCGGGCG GACCGTATTT CGTTCATTCG AACACGTACG GAACCGTGCA CAACCAGCTG
ATCGAGCTGG TCGACCCTGA TATGAACGAG CCGCACGCGG AAAGCACCGA CCTCGTCCCA
GCCGCAATCG GCTCGAGAGA ATACGGCTGC ATGCTGTACG CAGGCGACGA GGTGTCTCCC
CGAGGTTACC CTTCAACCAA CTTTTTCGAA AACGCCAACG TCGGATTCAC CGGCCTGACC
ATCGCCGTCA CCGAAAGCCG TGCGGATGGC GGCACGCTGA ACATCTCGTA CTCGAACGCC
GGCAAGCCGG AACCGCCCGA CGATTTCACG CTTACGCCCC TGTTCGACAG CGTCTCGAAC
ACGGGGACGC TGTCGTTCGA AGCGTCCACG AAGCCTGCCC TCGCCACGCC GCTGCCCGCG
GCAACGCAGC TCATCGTGGA CGGCCAGCCC CACGCCCTGC TGAACATCGA GACGGACGGC
ACCACGGTCA GCCTTCCGTA CTTTCTCGAT GCCGGCGATA TCGGGCCTTC GTCAACGTGC
GAGATCGTGT TCCCTGCCGG CATGTTCGTC TTGGCTCAAA CAGGCGGCAC GACTGCATAT
TCTCCCGAAG TGCGCCTCGA GCTGAAAGCA AGCCCGCTGC TCACCCCCGT CGACTGCGCG
GGCGGCTACC TGGGCACCGA GTACACCGAA GATTCCACGG TGCTCAGCAA CGTGTTCGCC
TGCCCCGACG GAAGCAAACG GTTCTTCCAG ATAGCCGACG GCCAGCTCAA GCTGCACGCC
ATCGACCCCG CCGATGTCGC ACGCGTATCC TCGCGCACGG TCGAGGGCGT TCCCGTTCCC
GCCGTCTTGG ACTACCGTCC TTCGCTTCGC GTCGTGCCGC TATCGAACGA GACCGCGTTC
GTGCTCATGC CTGCAGCGGG AGGCGAAAGC GCGGGATACT GGATCGATCT CCAGCGCGGA
ACCGTTATCG CGTCCTATCC CTTCGACCAA GCCGCTTCCC TCTCGCTTGC AGGCAGCGGG
TCCACCGTAT TAGCGGCGTC TTTCTATCCA GGCGGCGGCC GAACGATCGC CGCGCTCACC
CCGCTTGAAG ACGGAACCGT GGAAGCGCGC TACGGATGGA CCCAGGCCGA AACCGTGGTT
CCCGTCGACG CCGACGCGTT CGCCTTCGGG TTCATGGATG ACGCAGGCGC TTTGACGGAG
GAAGCGCGCA TCCTGCCCGC CGACGCCATC GTCGCGGCGC TGCGCGATGG CGCAACCCTT
CTCGAGCATG CCTCCGACAA TGGCGAGCTG TGCACGCCCG AGCGCGCCGC GGCCACGCTG
CCTCTGCGCG CCAGCCGCAT GCTGCTGGCC GTCGAGCGCG CGAAGGACGG CTTCTACGCG
CTGCTTTCTT CGGACTACTT GGATCCCGCG AAGCTGACGA ACCTGCTGAT CGAGTTCGAC
AGCACGGGAG CGGAACGGGC ACGATCTCCT ATCGAGGTGG ACGGATCCGA CGTATACATG
CAACTTCGGG TCGCTCGCCA CGGCACCGTG GCGCTGTCGC GGCGCATCGA CATGACGAAA
CCGCCCATCT CCCGCAGCGA CGTTCTGTTC TGCGATGCGA ACCTGCAACC GCTCTCCGTG
CTTACTACAG CATCCACCGG CTTGGGCGCA TGGCTCGACG ACGGACGCTG GCTCGACGTG
GGAATGAGCG TTGCGGCGGC AAACGGGCCG AAGGGGTCGC CCGTCGCAAC GGAATACGGC
GGCGCAACCG AAGGCGGCGA GGTCGACGAA TCCAAGCGCG CCCGATACGT GGTAACGACG
CAACTTGACA AAACTCCTGC AAACCCAGGC GATGACCCCG CCGATCCCGG CATTCAACCG
CAACCGCTCC CGCAACCAGC CGACGACCAG CCCGACAGCC CAAGCAAGCT CGCGCCTACC
GGCGACCACG CGAATGCGCT CGAGCTTGCC GCCCTGGCAA CGGCAGCGCT GGCCACCGCG
CTCTTCGCAC TCATGAGGCA AAATGAGGCA AAGGAATGGG GTTGA
 
Protein sequence
MQRFARSKAA RWAAFWLTAL LAAAFAGALA PASAHAQDAA HSPCIATVEQ RAAYAADGTL 
DEREAFQESL GNDEPSPGLV QQALAREQAQ NGIAPNAVPG DDGGMAAIGN AHVVALRVSF
PDRPFAEGDT LEALQALIGP RAVGEAALPS AGGFPYENLH DYYLRASYGA LTVTGEAFDY
AAQHERDFYT ANIGQLYKEA LDHLEASGID LARFDANGDG RIDAVYLHFA GGDTGWGSVW
WSNEQVLDVP DAVYADGTVR LWNAVALSNP CDQPWAAQTI IHETGHVLGL PDYYQYASQQ
GGSADRTGIL TFDIMMQNQG DHNGFSKWML GWLPDSKITR IFANETGIDV KRDGKVVQHV
DATADGSSSV EAALEAFASN DIDETGGIVV VANQDASMFS SYYLLQYDRF AGNQSVRYEQ
DGQPAELPSG FRLFRVQADL GSGGPYFVHS NTYGTVHNQL IELVDPDMNE PHAESTDLVP
AAIGSREYGC MLYAGDEVSP RGYPSTNFFE NANVGFTGLT IAVTESRADG GTLNISYSNA
GKPEPPDDFT LTPLFDSVSN TGTLSFEAST KPALATPLPA ATQLIVDGQP HALLNIETDG
TTVSLPYFLD AGDIGPSSTC EIVFPAGMFV LAQTGGTTAY SPEVRLELKA SPLLTPVDCA
GGYLGTEYTE DSTVLSNVFA CPDGSKRFFQ IADGQLKLHA IDPADVARVS SRTVEGVPVP
AVLDYRPSLR VVPLSNETAF VLMPAAGGES AGYWIDLQRG TVIASYPFDQ AASLSLAGSG
STVLAASFYP GGGRTIAALT PLEDGTVEAR YGWTQAETVV PVDADAFAFG FMDDAGALTE
EARILPADAI VAALRDGATL LEHASDNGEL CTPERAAATL PLRASRMLLA VERAKDGFYA
LLSSDYLDPA KLTNLLIEFD STGAERARSP IEVDGSDVYM QLRVARHGTV ALSRRIDMTK
PPISRSDVLF CDANLQPLSV LTTASTGLGA WLDDGRWLDV GMSVAAANGP KGSPVATEYG
GATEGGEVDE SKRARYVVTT QLDKTPANPG DDPADPGIQP QPLPQPADDQ PDSPSKLAPT
GDHANALELA ALATAALATA LFALMRQNEA KEWG