Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0480 |
Symbol | |
ID | 8414764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 612262 |
End bp | 613482 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023451 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_003180854 |
Protein GI | 257790248 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTGCA TCGGGATAGA CATCGCCAAG AGAAGCCACG TCGCAAGCGC CGTCGACGAG GACGGGGGCA CCGTCGTCGA GTCGTTCAAG TTCAACAACA CGGCGAGCGG CTTCGCGAAG ATGCTGAAGA GGCTTCGCAA GGCTGACGCG ACCCCGGAGA ACAGCTTGGT CGGAATGGAA GCCACGGGGC GCTACTGGGT GGCCCTGTTC GACTTCCTCT CCTCCCACGG CTACGAGGCC GCGGTGATCA ACCCCGTCCA GACCGACGCC TTCCGCGACG TCTGGACGGT CCGCAAGGTC AAGACCGACG CCCTTGACGC CGCGATGATC GCCGACCTCG TCCGCTACAA GCGCTACTCG CCCTCGGCGC TCGGCGACGA GGCGACCGAG GAGCTGCGCA ACCTCGCCCG CTACAGGATG TCGCTCGTGG AGCGCTCCAC AGCCTTGAAG AACAGGGCGA CGGCCATCCT CGATAGGACG TTCCCCGAGC TTGCAGGCCT GTTCAGCAAC GCGTACTGTC CGACCCGCCG CGAGCTGCTG CAGCACTGCG CCACGCCCGA CCAGGTGCTC GGAACCGACG TCGGGACGCT CGAGCGCATC CTCAGGGAGG CTTCGCGCGG GAGGTGCGGG CGAGCCAAGG CCGAGAGGCT CAAAGCGCTT GCCAAGGAAT CCGTGGGCGT CGGGTTCGGC TCCAAGACGC TCGCCTTCGA GGCGAGGCTT TTGATGGAGG AGCTCGACTT CGTGCAGGAC CAGGTCAAGG AGGTCGAAAG GGAGCTGGCC CGCCTGCTGG AGCGGACGCA GGGCAAGTGG CTCGTCACCA TCCCCGGCAT AGACGTCGCC CTCGCCTCCG TCATCGCGGG CGAGATCGGC GACCCGAACG CCTTCGACGA CCCCCACAGG CTCATGGGTT ATGCCGGCAT GGACCCCACG AGAAGCGAGT CCGGCGAGAC CGTGAGGTCC GACGGGCGCA TGTCCAAGCG AGGGCCCGGG GCGCTGCGAT GGGCGTTCAT GCAGGCCGCC GACTGCGCCA GGAAGAACGA CCCCTACTTC GGAGACTATT ACGACAGGAA GAGGGGCGTG GACGGGAAGC ACCACTACGT CGCCCTCTCG GGCGTCGCCC GCAAGCTCAT GGGCGTGGCG CTGGCGGTGA TGAAGGAAGG GCGGCCCTAC GAGCCGGCCC CTCCCCGCAA CCACCGCCCC GGCCACCTGA AGGAGGCGTG A
|
Protein sequence | MICIGIDIAK RSHVASAVDE DGGTVVESFK FNNTASGFAK MLKRLRKADA TPENSLVGME ATGRYWVALF DFLSSHGYEA AVINPVQTDA FRDVWTVRKV KTDALDAAMI ADLVRYKRYS PSALGDEATE ELRNLARYRM SLVERSTALK NRATAILDRT FPELAGLFSN AYCPTRRELL QHCATPDQVL GTDVGTLERI LREASRGRCG RAKAERLKAL AKESVGVGFG SKTLAFEARL LMEELDFVQD QVKEVERELA RLLERTQGKW LVTIPGIDVA LASVIAGEIG DPNAFDDPHR LMGYAGMDPT RSESGETVRS DGRMSKRGPG ALRWAFMQAA DCARKNDPYF GDYYDRKRGV DGKHHYVALS GVARKLMGVA LAVMKEGRPY EPAPPRNHRP GHLKEA
|
| |