Gene Emin_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0042 
Symbol 
ID6263794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp42942 
End bp44132 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content43% 
IMG OID642610505 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_001874947 
Protein GI187250465 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones109 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAT TCGGCAATAA AATTTTTACC GTTACAGAAA TAAGCTCATC CATAAAGGAA 
ATGCTTGAAG GCGTGCTTAA CGACGTGCGC GTGGAAGGGG AAATAAGCGG CCTTAAAAAA
GCGGCAAGCG GGCACATTTA TTTTGATATT AAAGATGAAA ACGCCCTTAT AAGCGCTGTT
TTATTTAAAG GCTACGCCCT TAGAACAGCC GACCTTAAAG ACGGGCTTAA AGTGCTTGTA
CGCGGCGACC TTTCTTGCTA TATCAAACAG GGCAGATACC AAATAATAGT AAAAAGCCTT
GTGCCCACAA GCGTGGGGGA TTTGTATTTG GAATTTGAAA GACTTAAACA AAAGCTTGCC
GCCGAAGGTA TTTTTGACGA AGAACGCAAA CGTCCTTTGC CGGAGTTCGC TTCAAGAATA
GGAGTTGTAA CTTCACCCAC AGGGGCGGCT TTACAAGATA TTTTGAGCGT GCTTAAAAGA
AGAAGCCCAA ATTTAGAAGT AATTATTTCA CCTTCGCTTG TGCAGGGGCA GGAAGCTCCC
GCCCAAATAG TAAAAGCTAT TGAACGGCTT AATAAAATAA ACCCCGCGCC TGATGTTATT
TTAGTGGGGC GAGGCGGCGG AAGCATGGAA GACCTTTGGT GTTTTAATGA TGAAGCCGTA
GCCAGGGCAA TTTATAAATC CAAGATACCC GTGGTTTCCT GCGTGGGGCA TGAAACGGAC
TTTACTATTG CCGACTTTGC GTCCGACCTG CGCGCGCCTA CGCCAAGCGC CGCGGCCGAA
ATTGTGGCGC AAAACAGCGC CGGCGTGGCA AATTACGTAA CCCAGCTTGT TAAACGTATG
ATTAACACGC AAAACCTGCT TATTTCTTTA GCTCAAAACA GGTTAAACAT TGCCATGTCT
AACAAATTTT TAAAAGACCC TCTTTTTTAC CTAAACCAGC GCGAGCAGGA AACCGACGAT
TTAACCGCGC TTCTTGACAG GGCATTTAAA GATAAAATTA AAGCAGCGGA TAATGCCCTT
TCAATGCTTA CGCATAAACT TAACGCTTTA AGCCCCGGGG CTGTTTTAAA AAGAGGTTTT
AGTATTGTAA GAAAACAAGG CCGCCCGGTA AAAAATGCCG AAGAAGTAAA TAAAGGCGAT
ATTTTAGATA TAGAACTTTA TAAAGGAAAT ATACAAACGG AGAAGATTTA A
 
Protein sequence
MSEFGNKIFT VTEISSSIKE MLEGVLNDVR VEGEISGLKK AASGHIYFDI KDENALISAV 
LFKGYALRTA DLKDGLKVLV RGDLSCYIKQ GRYQIIVKSL VPTSVGDLYL EFERLKQKLA
AEGIFDEERK RPLPEFASRI GVVTSPTGAA LQDILSVLKR RSPNLEVIIS PSLVQGQEAP
AQIVKAIERL NKINPAPDVI LVGRGGGSME DLWCFNDEAV ARAIYKSKIP VVSCVGHETD
FTIADFASDL RAPTPSAAAE IVAQNSAGVA NYVTQLVKRM INTQNLLISL AQNRLNIAMS
NKFLKDPLFY LNQREQETDD LTALLDRAFK DKIKAADNAL SMLTHKLNAL SPGAVLKRGF
SIVRKQGRPV KNAEEVNKGD ILDIELYKGN IQTEKI