Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0839 |
Symbol | |
ID | 8415129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1035183 |
End bp | 1037243 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023805 |
Product | Relaxase/mobilization nuclease family protein |
Protein accession | YP_003181202 |
Protein GI | 257790596 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCATGC TCAAGACGCT CGCCCGGAAG TTCGGCGGCT GCAAGCAGAT AAAGGACTAC CTCGCGCGCA ACGGCCGCGC CGTGACGTTC CGCTCAAGCT CAATCGATCT CGATAGCGCC GGCTGGGACG GGCAGATGGA CGCGACGCGC CAGGTGTTCG GCAAGGACAC CGGGCGAAAG TACTACCATT TCGTAATCTC ACCCGATCCC GAGGACGGCC TGGATGCCAA GGCGGTAGAC GCATTGGCAT GCGATTGGGT GCGCGAGCGC TACAGCGGAT ACCAGTGGGT GATCGAGACT CATATAGATA ACGGCATTCC GCATGCACAT ATCGTCGTCA ACTCCGTCAA CCCCGTTGAC GGGAGGAAGA TACACCTCGA CGACGATGAC GTGCAGGCCG ACGCGATGGA GTTGCAGAGA ATCTGCCGCG ACTACGGCTA CACCGCCTTC GACAACTTCA AGTTCACGAG GGACGAGGAC GGTTCCTGGT ACGCACGCAC GCCGCGCCCC GACCGCCGGC GCGAGGTCGT CTTGCAGGAG GCGCGCCCGG CGCGCCGTCA TGTGACGGAC GCCCAGCGCC GCGCCAGGTA CAGCGGCAAG AAACTCTGGA CCGATGCGAT GCACGACGAT ATCGAGTCGG CTTTGCAGGG GTGCCGGACA TGGCCCGCGC TCGAGAGGCG TTTGGCGGAG AAGGGCTACC GCATCAATAT CAACCGCCGC GGAGTGCTGA CCTTCTACCC GCCCGAGGGC AAGGGCCATC CCGTGAAGGG TTACAAGCTG GACGACTCCT ATACGGTCGA GGGATTGCGC GAGCGCCTTG CCGTGCGACT CGAGGGACAG CGCCCGAACG TCAGGATGTG CACCGAGGAC CTTGTCCCGG ACGTCGTCCT GCCGCCCATG ACTTTCGAGG CGGTCGCATC CAACAATCTC GAGCGGTCTA AGAGGATCGA GCGCTCCGCC GCCCGCCTTG CCGCCGCGGC CGACGCCGTG AACATTATCC GCGAGCGCGG CTACCGCTGC TATGCGGACA TGGCGGCCGA TGCCAGGCGC CTCGCCGCCG AGGTCGACGC GCTCGACGCG AGGGTCGACG CGCTCAAGCT GGAGGCCGAG CAGACCGCCG AGGCCATCCG CCAGGTCGAG GTCTACACGT CTTGCGTCGC GCAGTTGACC GCGCGCCCGG AAGGAGCCGA AGCGCTCGCG GCATGGCGCG AGGACAATGC AGACGAGCTT CGCGAGATGG AATCGATCCG CAAGTGGATG GCCGAGCGCG GAATCGTGCT CACCGATATG ACATACACGG ACCTCCTCGC ACGCCGCGAC GGCGTCAGGG CCGACGTGGC GAACCTCGCT TCCATGGCAG TCGATATCCG CAACCGCTCG CGCCGACTTT CCGCCGCTGT CGATGTCCTG GACTCATCGC AGATGCCGAT TGGCCCCACA GATGTACGGG GCCGCGGCGA GAGGCAGAAG GGCGCCGGTC ACGGCCTGCG CGACGCCAAG GTCATCACCG CGAACCAGAC CGCCGAGCTG ACGCGCGAGT ACACGCGCTA TCGCAAGGCC TATTCGGGGG CCATCGCGGC GGGCGCCCTC ACGCCGGATA AGATCAAGCA GATTGAGCGC GACCACGAGC GGGCCAAGGC GAAGATCATC GGCGAGCAGC GCGAGGAGGA CCTGCGCGCG GAACGCTCGG CGTACGGGAT GGCCGGAAGC GAATGGGCGC AGGATGCTCC CGTGACGGTC GCCGCACCCG GCGGGACGGA CAAGAGGTCG AAGCCCGGAT GGAAGCCGTA CCTCCAACGC CCCGCGACCG AGAAGCAGAT GAAATATCTC GAAGACCTCG TAGAGTCCGG CATCATCGCG CAGGCCGACA TCGACAGCCT CGGCGGCGAG CCGACCATCG CGGACGTCAA CGCGCTCCTG AACTCGCATC CCAAGGTCAA GAACCTGGAG CTTGAGAAGA TGGACGGCGA CGGCGCGGAC GGCACACGGA CGTACACGCA GACCCGAGAC GATATCGAGG ACGACGGCTA TACGCCGGCG AACTACACGA GGAGGAGATA G
|
Protein sequence | MAMLKTLARK FGGCKQIKDY LARNGRAVTF RSSSIDLDSA GWDGQMDATR QVFGKDTGRK YYHFVISPDP EDGLDAKAVD ALACDWVRER YSGYQWVIET HIDNGIPHAH IVVNSVNPVD GRKIHLDDDD VQADAMELQR ICRDYGYTAF DNFKFTRDED GSWYARTPRP DRRREVVLQE ARPARRHVTD AQRRARYSGK KLWTDAMHDD IESALQGCRT WPALERRLAE KGYRININRR GVLTFYPPEG KGHPVKGYKL DDSYTVEGLR ERLAVRLEGQ RPNVRMCTED LVPDVVLPPM TFEAVASNNL ERSKRIERSA ARLAAAADAV NIIRERGYRC YADMAADARR LAAEVDALDA RVDALKLEAE QTAEAIRQVE VYTSCVAQLT ARPEGAEALA AWREDNADEL REMESIRKWM AERGIVLTDM TYTDLLARRD GVRADVANLA SMAVDIRNRS RRLSAAVDVL DSSQMPIGPT DVRGRGERQK GAGHGLRDAK VITANQTAEL TREYTRYRKA YSGAIAAGAL TPDKIKQIER DHERAKAKII GEQREEDLRA ERSAYGMAGS EWAQDAPVTV AAPGGTDKRS KPGWKPYLQR PATEKQMKYL EDLVESGIIA QADIDSLGGE PTIADVNALL NSHPKVKNLE LEKMDGDGAD GTRTYTQTRD DIEDDGYTPA NYTRRR
|
| |