Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1535 |
Symbol | |
ID | 8415833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1827978 |
End bp | 1830881 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024503 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003181892 |
Protein GI | 257791286 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.960064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0795907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGCA TATTCACGCG TTTCACCCTG CGCTCGCTCG CTCAGAACCG CGTGCGCACG GCCGTGACCG TCGTCGGCAT CGCGCTGTCC ACGGCGCTTC TGGCCGCCGT GCTGACCAGC GTGGCAAGCG TGCAGCAGGG CTTGCTCGAG CGCACGATGG TAACTGAAGG ATCATGGCAC GTGTTTTCCC CCGACGTGCC CGCTCAGGGC ATCGATGCGC TCGTGGAAAG CGACGCCGTG ACCGACCTCG CCACCTTCCG CGACCTCGGA TCGGCCGCAC TCGCGCCCGA CGACGCCAAC CGTCTGGGTG CGTTCATCGC GCTGAAGACG ACGCCGACCA CCGTCAAGGG CGTCGACGAG CCGGGCGGCG CGCCGTACTC CCTGATGCCC GAGCTGGCAA GCGGGCGATG GCCCGAGACA GCGGACGAGA TCGTGCTGCC CGACTATCTC CAAGGCGAGG AGCTGGGCGC GGGAGGCGCC GAGGGCGTTG TTTCGAACGG CCCCCTTGAA ACGGGTTCCA CCCTCACGGG AGGTTTTGGA AACCTCGCAG CCGCCGAAGA CGGCGCGCGC CTCGCCGACG GAAACGAGAC GCGCGTCGAG AACGCCCGCG AGCGCACGCT GACGGTTACC GGATTCTACA AGCGACAAGC GCCTTTCCTC GCCAACAACT ACACGGCCTC TTCCGCGCCC AGCGTCGCCC TCACCGTGGC CGACGCGTCT GAGGAAGGAG CGGCTGCGGG CGCGTATCTG GTCACGCAAG GTCTGGGAAC CCTCGACGAG ATGAAGGCCT TCTTCGCCGA CGCGACCGGA TTGGACGATA CGGCCGCCAC TCTGTACCAC ACCAGCCTGT TCTCGTACTT GGGCATCTCC GACGGCAGGC CCATCTGGGG AAGCCTCTGG GCGGTTGCCG CCGTGTTGGC CATCGTCATC GTGGCGGCGT CCGCATCGCT GATCTACAAC GCATTCGCCA TCTCGGTGGC CGAGCGTACG CGGCAGTTCG GACTGCTGGC CTCTCTCGGC GCTTCGAAGC GGCAACTGCG CAGCACCGTG CTGGCAGAGG CGCTGCTGCT GGGAGCCGTC GGCGTGCCGA TTGGCCTCTT AGCGGGCGTG GCGGGCACGG CAGGCGCGTT CTCGTCGTCG CAGGAGGCGT TCGCTGCGAT GTTGGGATCG GGATCGGCGG GGCTCGCGGT GCACGTGGAT GCCGGAGCGT TGGCTGCGGC CGCGGCACTG TCGATAGCCA CTTTGCTGGT GAGCGCCTGG GTGCCGTCGG CCCGCGCCGC CCGCGTCTCA GCCGTCGACG CCATCCGCCA AACGCAGGAC GTGCGCCTCT CGAAGCGCGC AGAACGGTCG GCACCATCGC ACGCGCCCGA CGTCGGGCGC GTGAAGCTGG GCATCGCGGG CAGGCTGTTC GGCATCCCCG GGTTCGTCGC ACACCGCAAC CTGTCGCGCT CGGCGACGCG CGGGCGCACC GTGGTGGCAT CGCTCGCGGT CAGCGTGACC CTCGTGGTGG TCGCCGGCAG CACGGCGCTG TACCTCGCGC CGCTGTCCGA CCGCGCCAGC AGCTCGCGCG GCGCTGGGAG CGGAGCCGAC ATCGTGGTGT CGGCGCATCC CGACTACACC ACCCCGCGCG AAGGGAACGA CCTGTCCGAC TACGCCGCCG AATACGACGA GTTCCTGGCG CGGGCGAGTG AGATCGAGGG CTTGCAGCTC ATAGGCTCGT GCCGCCAAGG TCAGGCCGAA AGCGTCGTCG ACGGTCGCAT GATATCGCAA GAGGCGCGCG CTGCTCGGCA GGCTTACAAC TCCCAGACGA GCGCCGACTG GGTGCCCGAC AGCTTCGGCG AGGACGGCGA TTACTACGGG GCCCTGTACA CCTTCTTCGT CGACGATGCC TCGTGGCGCG CGCTGCTGGA CGAGCTCGAC CTGGACGTAG CCGCCTACAC CGACCCGGAG AACCCGCGCG CCATCGGCCT GAACACCTAT CAGGACAGGA TGCCCGACGG CACGTACGTG TCCACGAAGC CCTTCGCGGG CACCGGCGCC GTCGACCTGT ACGTCACCGA GGAGCGCGAA GGATTCTCCA ACATGGGCTT GCAGGAAGGC CCGGACGGAC GGCCCCTGGT GGGGTACCTC GACCGGGAGG CCGGGGTCAG CGGGACGGCC GACATCGTCA CCGCCCCGAT CGACGAGGCG GCCACGGCTT TCCGCATCGA GATAGGAGCG CTCTGCGACG AGGAGCCGGC CGTGCTGAAC GCCATCGCGG CCAACAACCA CTTCCCCTCG ATCATCCTGC CGGAGAGCGT TGCGGAGAGA GCCGCAGGGC TGGGCGACTA CCACTCGAAC CCGTACACGT ACTCGTTCGC CGGAGCCTCG TTCACGGCGG AGGATCACGC GAAGGCTTCC GACGAGCTGG AGGCGCTGGC GCGCGACTTG GGCGACGTCG TGGTCAACGT CAGCGACATC GAGAATGCGG CGCGCCAGAA CCGCCTGATC GCGCAGGCCT TCCAGCTGTT CGTGCTGTGC TTCTCCGTCA TCACGACGCT CATCGCCGTG GCCAACGTGT TCAACACGCT GGCGAACGGG ATCATCCTGC GCACGCGCGA GTTCGCCGCG CTGCGTTCCA TCGGCATGGG CAACCGCGCC TTCGCCCGAA TGCTGGCCTA CGAGTGCGCG AGCTACGCCC TGCGCGGTTT GGGGATAGGG CTGGCGGCAG CCGTAGCGGT TACATTCGCG CTGTTTGCAG CAACGTCCAT GGCGTTCGCC GGCCTCGAGT TCACGCTGCC GTGGGACTAC GTGGCTCTCG CCGTGGCCAT CGTGGCCGTC GTGCTGGCGC TCAGCGTGGC CTACGCGCTG CGCCGCTCGC ACGCCTCCAA CATCGTGGAG GCTCTGAGGT CGGACGCGAT CTAG
|
Protein sequence | MAGIFTRFTL RSLAQNRVRT AVTVVGIALS TALLAAVLTS VASVQQGLLE RTMVTEGSWH VFSPDVPAQG IDALVESDAV TDLATFRDLG SAALAPDDAN RLGAFIALKT TPTTVKGVDE PGGAPYSLMP ELASGRWPET ADEIVLPDYL QGEELGAGGA EGVVSNGPLE TGSTLTGGFG NLAAAEDGAR LADGNETRVE NARERTLTVT GFYKRQAPFL ANNYTASSAP SVALTVADAS EEGAAAGAYL VTQGLGTLDE MKAFFADATG LDDTAATLYH TSLFSYLGIS DGRPIWGSLW AVAAVLAIVI VAASASLIYN AFAISVAERT RQFGLLASLG ASKRQLRSTV LAEALLLGAV GVPIGLLAGV AGTAGAFSSS QEAFAAMLGS GSAGLAVHVD AGALAAAAAL SIATLLVSAW VPSARAARVS AVDAIRQTQD VRLSKRAERS APSHAPDVGR VKLGIAGRLF GIPGFVAHRN LSRSATRGRT VVASLAVSVT LVVVAGSTAL YLAPLSDRAS SSRGAGSGAD IVVSAHPDYT TPREGNDLSD YAAEYDEFLA RASEIEGLQL IGSCRQGQAE SVVDGRMISQ EARAARQAYN SQTSADWVPD SFGEDGDYYG ALYTFFVDDA SWRALLDELD LDVAAYTDPE NPRAIGLNTY QDRMPDGTYV STKPFAGTGA VDLYVTEERE GFSNMGLQEG PDGRPLVGYL DREAGVSGTA DIVTAPIDEA ATAFRIEIGA LCDEEPAVLN AIAANNHFPS IILPESVAER AAGLGDYHSN PYTYSFAGAS FTAEDHAKAS DELEALARDL GDVVVNVSDI ENAARQNRLI AQAFQLFVLC FSVITTLIAV ANVFNTLANG IILRTREFAA LRSIGMGNRA FARMLAYECA SYALRGLGIG LAAAVAVTFA LFAATSMAFA GLEFTLPWDY VALAVAIVAV VLALSVAYAL RRSHASNIVE ALRSDAI
|
| |