Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0959 |
Symbol | |
ID | 6263881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 1058326 |
End bp | 1059516 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642611439 |
Product | nuclease |
Protein accession | YP_001875849 |
Protein GI | 187251367 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00000273915 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGATAT TAAAAATATC GCTTAATAAG ATTACTCCCT ACATCAATAA TGCTAAGGAG CATCCTCAGA GCCAAATAGA CCAGATAAAA GCAAGTATTC TGGAGTTTGG CTTCAATGAT CCTATTGCCA TTGACGAGAA CTTTGTAATC ATTGAAGGCC ACGGCAGATA TGAGGCATTA AAACAACTTG GTCATAAAGA AGTTGAAGTT ATCCAACTCT CCCACCTTTC AAAAGTTCAA AAGAAACAAT ATATTTTGGC GCATAATAAA ATTGCTTTAA ATACCGGGTT TGATATTGAG AAGTTAAAAC TAGAAACAGC AGCCATTATT GAACTTGGCG GCAAACTGGA CATTCTAGGC TTTACCGATA TTGATGAAGT TCAGATGCCG GAAACAATCG TATTAGAAGA AAACATAGAT GATTTACCCA GCATAGACAA TGCTCCTGCT GTCACTAAGA CCGGGAATGT TTGGCTTTTG GGTAAGCACA GATTATTATG TGGTGACAGC ACAAAAAAAG AAAGTTTTGA CGCAATCTCC GCCAAAGAAG CTGATTTTAT ATTTACAGAC CCTCCTTATG GGATAGATAT AGCCAAGAGT GGCGCAATAG GGAGTAGCGG TAAAAAGTAT AAGCCGATAA TCGGAGATAA TGACACCGCC ACAGCAAGAG CATTTTATGA GTTGGCAAAA GAACTAAACC TCAAAGATAT GTTGATTTGG GGTGCAAACT ATTTTGCAGA CTTTCTCCCA GTAAGCAGAA GATGGCTTGT ATGGAATAAA AGGGGCGAAA TGGATTCTAA CAACTTTGCT GATGGAGAGA TAGCTTGGGT ACGAAGTGAT GGCAACCTGC GTATATTCAG CCATGTGTGG AGTGGTTATA CAAGAGAAGG CAGCCATAAA GAAGAATTAA AGACACGCAT CCACCCAACA CAAAAGCCTG TCGGCGTATG CATAGATATC TTTAAAGAAC TAGAACCCTT TGAAGTTGTC TTTGACGCTT TTATGGGTAG TGGCAGTACC TTAATAGCTT GTGAGAAGAT GAAGAAGGTT TGCCTGGGCA TAGAGATTGA TCCTAAATAT TGCGACCTGA TTATTGAACG CTGGCAGAAC TATACCGGAG AAAAGGCCGT ACTGAAGAAC ACAGGAAAGA CTTATGAAGA AGAAAAAAAA GACAGCAAAA AAGGGAACTA G
|
Protein sequence | MQILKISLNK ITPYINNAKE HPQSQIDQIK ASILEFGFND PIAIDENFVI IEGHGRYEAL KQLGHKEVEV IQLSHLSKVQ KKQYILAHNK IALNTGFDIE KLKLETAAII ELGGKLDILG FTDIDEVQMP ETIVLEENID DLPSIDNAPA VTKTGNVWLL GKHRLLCGDS TKKESFDAIS AKEADFIFTD PPYGIDIAKS GAIGSSGKKY KPIIGDNDTA TARAFYELAK ELNLKDMLIW GANYFADFLP VSRRWLVWNK RGEMDSNNFA DGEIAWVRSD GNLRIFSHVW SGYTREGSHK EELKTRIHPT QKPVGVCIDI FKELEPFEVV FDAFMGSGST LIACEKMKKV CLGIEIDPKY CDLIIERWQN YTGEKAVLKN TGKTYEEEKK DSKKGN
|
| |