Gene Emin_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0959 
Symbol 
ID6263881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1058326 
End bp1059516 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content40% 
IMG OID642611439 
Productnuclease 
Protein accessionYP_001875849 
Protein GI187251367 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000273915 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGATAT TAAAAATATC GCTTAATAAG ATTACTCCCT ACATCAATAA TGCTAAGGAG 
CATCCTCAGA GCCAAATAGA CCAGATAAAA GCAAGTATTC TGGAGTTTGG CTTCAATGAT
CCTATTGCCA TTGACGAGAA CTTTGTAATC ATTGAAGGCC ACGGCAGATA TGAGGCATTA
AAACAACTTG GTCATAAAGA AGTTGAAGTT ATCCAACTCT CCCACCTTTC AAAAGTTCAA
AAGAAACAAT ATATTTTGGC GCATAATAAA ATTGCTTTAA ATACCGGGTT TGATATTGAG
AAGTTAAAAC TAGAAACAGC AGCCATTATT GAACTTGGCG GCAAACTGGA CATTCTAGGC
TTTACCGATA TTGATGAAGT TCAGATGCCG GAAACAATCG TATTAGAAGA AAACATAGAT
GATTTACCCA GCATAGACAA TGCTCCTGCT GTCACTAAGA CCGGGAATGT TTGGCTTTTG
GGTAAGCACA GATTATTATG TGGTGACAGC ACAAAAAAAG AAAGTTTTGA CGCAATCTCC
GCCAAAGAAG CTGATTTTAT ATTTACAGAC CCTCCTTATG GGATAGATAT AGCCAAGAGT
GGCGCAATAG GGAGTAGCGG TAAAAAGTAT AAGCCGATAA TCGGAGATAA TGACACCGCC
ACAGCAAGAG CATTTTATGA GTTGGCAAAA GAACTAAACC TCAAAGATAT GTTGATTTGG
GGTGCAAACT ATTTTGCAGA CTTTCTCCCA GTAAGCAGAA GATGGCTTGT ATGGAATAAA
AGGGGCGAAA TGGATTCTAA CAACTTTGCT GATGGAGAGA TAGCTTGGGT ACGAAGTGAT
GGCAACCTGC GTATATTCAG CCATGTGTGG AGTGGTTATA CAAGAGAAGG CAGCCATAAA
GAAGAATTAA AGACACGCAT CCACCCAACA CAAAAGCCTG TCGGCGTATG CATAGATATC
TTTAAAGAAC TAGAACCCTT TGAAGTTGTC TTTGACGCTT TTATGGGTAG TGGCAGTACC
TTAATAGCTT GTGAGAAGAT GAAGAAGGTT TGCCTGGGCA TAGAGATTGA TCCTAAATAT
TGCGACCTGA TTATTGAACG CTGGCAGAAC TATACCGGAG AAAAGGCCGT ACTGAAGAAC
ACAGGAAAGA CTTATGAAGA AGAAAAAAAA GACAGCAAAA AAGGGAACTA G
 
Protein sequence
MQILKISLNK ITPYINNAKE HPQSQIDQIK ASILEFGFND PIAIDENFVI IEGHGRYEAL 
KQLGHKEVEV IQLSHLSKVQ KKQYILAHNK IALNTGFDIE KLKLETAAII ELGGKLDILG
FTDIDEVQMP ETIVLEENID DLPSIDNAPA VTKTGNVWLL GKHRLLCGDS TKKESFDAIS
AKEADFIFTD PPYGIDIAKS GAIGSSGKKY KPIIGDNDTA TARAFYELAK ELNLKDMLIW
GANYFADFLP VSRRWLVWNK RGEMDSNNFA DGEIAWVRSD GNLRIFSHVW SGYTREGSHK
EELKTRIHPT QKPVGVCIDI FKELEPFEVV FDAFMGSGST LIACEKMKKV CLGIEIDPKY
CDLIIERWQN YTGEKAVLKN TGKTYEEEKK DSKKGN