Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2242 |
Symbol | |
ID | 8416565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2634977 |
End bp | 2636101 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645025228 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_003182592 |
Protein GI | 257791986 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000922808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.737465 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTATG TTACCTCAAT CGGCCTCGAC GTACACGCAC GTTCGATAAC AGCTGTCGCA TTCGATCCTG CAACAGGAGA AGTGCGTTCG AAGAGATTCG GCTATGCTCC GGCCGAAGTC GCTGCATGGA CGCTAGCCTT CGATTCCCCT AAAGCGGTTT ACGAATCCGG CGTGACCGGA TTCCACCTTG TCCGCGCCTT GCGAGGTCTT GGGGTCGACT GCATCGTCGG CGCCGTGTCC AAGATGCACA AGCCTGCAGC CGACAAGCGC AAGAAGAACG ATCGCAACGA TGCCGAGTTC TTGGCGCGTC AGCTTGCCGC CCGCAACATC AAAGAGGTCT GGATACCCGA TGAGGAGTGC GAGGCGGCAC GCGACCTCTC CCGCGTCTTG GAAGACGTCC GCGAAGACCT CACGCGCGCG AAGCATCGCC TAACCCACTT GCTCATACGG CATGGCTACG CCTTCGACGA GATCGGGCCG GACGGCAAGC GCAAGGGCAA GTGGACGAGG GCGCATTGGG AGTGGATACG AAGCATCGAG CTTTCCGAAG CGGCGGCGCG CGACGCCCTT GACTTCTACG TTTCCGAAGT GCGCCATATC GAGGCCCAGA AGAAGGCCAT CGAAAAAGAA ATCGCCGACT ACGCCCGTGC CGATCGTTGG CGTGGGCGGG TGGAAGCCCT TCGCCGTCTC AAAGGCATAG AGACCATGAC GGCGTTTTCC CTTGTGGCGG AAGCGGGGGT GTTCTCTCGC TTCGAGAGCG CCCGCGCATT CTCGTCCTGG CTCGGGCTCA CTCCTTCCGA GCATTCCAGC GGAGAGAGGG TGAGCCGCGG CGGCATTATC AAAGCCGGCA ACACCCATCT TCGAAAGCAG CTCGTAGAGT CGTCGTGGCA CTACACGAGG GCTACGACCA AGCGCAAGAG GGATCAGCGG GGAGAAGAGG TGCCCCTTGC AATCGAAAAC CATGCCGCCC GGGGGCGTGA AGCGCCTTGC AGAGAGGCGG GCTCACTTTG CAAGGCGCAA TCTTCGCCCG GCGGCGGCCA ACGCGGCCAC CGCAAGAGAG CTCTCCTGCT GGGTATGGGT TGTGGGCTGC ATGTCGGAAG GGGCTCTCGT ATAGAAACTC GTTAA
|
Protein sequence | MSYVTSIGLD VHARSITAVA FDPATGEVRS KRFGYAPAEV AAWTLAFDSP KAVYESGVTG FHLVRALRGL GVDCIVGAVS KMHKPAADKR KKNDRNDAEF LARQLAARNI KEVWIPDEEC EAARDLSRVL EDVREDLTRA KHRLTHLLIR HGYAFDEIGP DGKRKGKWTR AHWEWIRSIE LSEAAARDAL DFYVSEVRHI EAQKKAIEKE IADYARADRW RGRVEALRRL KGIETMTAFS LVAEAGVFSR FESARAFSSW LGLTPSEHSS GERVSRGGII KAGNTHLRKQ LVESSWHYTR ATTKRKRDQR GEEVPLAIEN HAARGREAPC REAGSLCKAQ SSPGGGQRGH RKRALLLGMG CGLHVGRGSR IETR
|
| |