Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0729 |
Symbol | |
ID | 8415019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 917566 |
End bp | 919200 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645023700 |
Product | Integrase catalytic region |
Protein accession | YP_003181097 |
Protein GI | 257790491 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2801] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTACG GCAAAGAATA CAAGGAGGAG GTCCTGAGCA GGTTCCACGC GAGCGGCATG TCGATGCGCG CGGCGTGCTC GAGCCTCGAG GGGTTCCCGT GCGCGGCGAC GCTCTCGGCC TTCGCGCGCG AGGAGGGGGC CGGCCTCCTG CGCCCGCCCG CGCTCGCGGT GCCGGGCAGG TGCGAGGGCC GCCGGGCATG GGAGCCCTAC CCCCTCGGGA CGAAGCGGGA GGCGATGCGG CTCCTCGCGG GCGGCATGGA GCCGCGCTTC GTCGCCGGGC GCCTCGGGAT CGCGAGCGCC GCCCCCGTCC GCCTGTGGGC CTCCAGGCTG GGGCGCCTCG ACGGGCTCGA CGGCCGGTCC CGTCCCGACG GCGAGCGCGA GCGCGGGGAA GCTGTTACGA TGTTCGCGCG AGGGTCGTCC GTCTCCGAGA TAGCCGGACG GATCGGCGCG GACAGGAGGA CCGTGCGCCG CTGGCTGGAC AAGGCCGGCG TCGAGCGGAA GCGCGCGACG AAGGGGAAGG GCGGCGAGGG CGTGGCGAAG GAAGAAGGCG GCGAGCGGGG CGAATGGTCC CGCGCATGGG GCGACCTCCC CGAAGGCGAC CCCGTCGAGC GGGCGCGGCT GGCCGAGGTC AGGCTCGCGG AGGCGCTGGC GGTGTTGGAC GTCCTAAAAG CACCAGGCCC GGGCTCTTTG AGCAATTCGG AGAAGCGCCG GGCGGGCGAG AGGGCGAGGG CGATGGCGGC GAGGGCGAGG GTCGATGACG TCCTGAGGGA TTTCCGCATC GCCAGGAGCA CGTACTTCTC GCAGGCGGCG ATGGCGGCCA GGCCCGACAG GCACGCGGCC CTGCGGGCGC GCGTGCGCGC GGCCTTCGAG GGCTCGAAGG GCCGCTACGG GTCGCTGAGC GTGTGGGCGG CCCTGCGGCG GGGCGAGGGC GCGCCCGTGC GCGCCCGCGA CCTCGCGCCC GGGGACATGG AGGCCCCCGT CGTCGTCTCC GAGAAGGTCG TGCGCCGGAT CATGCGCGAG GAGGGGCTCG TCCCGGTCCA GGTCAAGGAG CGCCGGCGCC ACAGCTCCTA CGCGGGCGAG ACCGACGAGC GCCCCGCGAA CCTGCCGCTT CGAGAGGACG GGACGCACGG CTTCCGCGCC GACGCGCCGG GCAGGCTCGT CGTGACCGAC GTGACCGAGT TCGACCTCGG CGGCCTCAAG GTCTACCTCT CCCCGATCAT AGACTGCTTC GACGGCTGCC CGGTGGCGTG GCGGACGTCG ACGCGCCCGG ACGACGAGCT GACGGCGGGC TCGCTGGAGG ACGCGCTCGG GCGCCTGGAG GAGGGCTGCG CCGTCCACAC CGACGGCGGC GGCAACTACC GCTCCGCCAG ATGGAAGGGC GTCTGCGAGG CCAACGGCCT CGTCAGGTCG ATGTCGCGCA AGGCCAAGAG CCCCGACAAC GCGAGGGCGG AGGGCTTCTT CGGGACGCTC AAGCAGGAGT TCTTCTACGC GAGGGACTGG AAGGGGACGA CGAAGGGGAG CTTCGTGCGG GCCCTCGACG AGTACATCGT GTGGTATCGT GACGAGAAGA TCAAGAGATC GCTCGGATGG AAGACGATAG CGGCCCATAG GGCGGCGCTC GCCGCAGCCG CGTAG
|
Protein sequence | MAYGKEYKEE VLSRFHASGM SMRAACSSLE GFPCAATLSA FAREEGAGLL RPPALAVPGR CEGRRAWEPY PLGTKREAMR LLAGGMEPRF VAGRLGIASA APVRLWASRL GRLDGLDGRS RPDGERERGE AVTMFARGSS VSEIAGRIGA DRRTVRRWLD KAGVERKRAT KGKGGEGVAK EEGGERGEWS RAWGDLPEGD PVERARLAEV RLAEALAVLD VLKAPGPGSL SNSEKRRAGE RARAMAARAR VDDVLRDFRI ARSTYFSQAA MAARPDRHAA LRARVRAAFE GSKGRYGSLS VWAALRRGEG APVRARDLAP GDMEAPVVVS EKVVRRIMRE EGLVPVQVKE RRRHSSYAGE TDERPANLPL REDGTHGFRA DAPGRLVVTD VTEFDLGGLK VYLSPIIDCF DGCPVAWRTS TRPDDELTAG SLEDALGRLE EGCAVHTDGG GNYRSARWKG VCEANGLVRS MSRKAKSPDN ARAEGFFGTL KQEFFYARDW KGTTKGSFVR ALDEYIVWYR DEKIKRSLGW KTIAAHRAAL AAAA
|
| |