Gene Emin_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0426 
Symbol 
ID6262501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp455493 
End bp457154 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content40% 
IMG OID642610896 
ProductDNA repair protein RecN 
Protein accessionYP_001875320 
Protein GI187250838 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAAAAA ATTTAAGTAT AAAAAATTTC GCTATTTTAG ATGATATTAA TTTAGAACCG 
GCGCCCGGCC TGAATGTGTT TAGCGGTGAA ACAGGCGCGG GCAAATCAAT TATTATAGAA
GCTTTGGGTT TTGTCTTGGG CGCGCGCGGA GGAAGCAGTT TAATTAAAGA GGGCGCGGGC
AAAATGTCGG TTTCCGCGTC TTTTGATTCA TCTTTTATAC CCAAAAATGT GGCCGCTAAA
TATAATATAT GTGGCCCGGT GATTAACATT AAAAGAGAAC TCGACATAAA AGGCAAAGGC
AAAGGATGGA TTAACAATAC CGTAGTTCCT GCCGGCGCCT TGGCTGAATT GGGCGAATTT
TTAGTTGACT TTCACGGCCA GCACGACCAT CATACGCTTT TAAAAACCTC CACACATTTA
GATTTGCTAG ATAAGTTTGC AAAAAACCAT AAACATTTGG AAGGCATTGC GCTTTACTAT
AATAAAATGC GCGACATAAT GTCTAAAATA GAAGCTCTTC ATATGAGCAA AACGCAAAAA
GAAAAGCTTC TTGATCTTTA TAGGTTCCAA TATAAAGAAA TTATGGACGT TAACCTTAAA
CCAGGTGAAG ATATTGAAAT CGACAACGCT TTGCCAAAGC TTAAACACTC CGGCAAATTA
AAAGAATTGG CTAAAGATGC TTATAACCTG CTGTATGAAG GCGAAACCGC GGGTATAGAT
TTAATATCCA AGGCGGAAAA AAACATTTCA GAAATGTCCG AACTGGACGA GGCTTTAGCC
CCTGTACTGG AAGAGGTAAC ATCATCTCTG CGCAATTTGG AAGACGCTGC CCAAACGCTT
TACACTTATA ATGAAAACAT TGAGGCGGAC CCCTCGATAT TGGATGCTAT GCTGAGCAGA
CAGCAAAAAA TTAACAATCT TAAATTAAAG TACGGCCCTG AAATACAGGA TATTTTGAAT
AACGCCGAAA TATTTTTAAA ACAAATTGAA AGCCTGTCCT ACTCAGAAGA AAAGGAGCAG
GAATTACAAC AGGAACTTGC CGAAGTAAAA GAAAAGCTTA TGAAAGGCTG TATGGTTTTG
CATGAATCGC GCGTAACGGC AGCATCAAAG CTGGATTCTT TGATTACCTT AGAAATAAGC
AAGTTGGGAT TTAATGACGT TAAATTTAAA ACGGATATTA TTTTTGAGGA AGAGTCTTTG
GGTCCTAAAG GCGCCAATAT AGTTGAGTTT TTATTCTCGC CAAACCCGGG GCAAAGCCTT
AGACCTTTAA AAAACATAGC TTCCGGCGGA GAAATGAGCC GCGTTATGCT GGGCCTTAAA
ACCGTTTTGG CCGGTAACAC GCCTGTTATG GTATTTGACG AAATTGACGC GGGCATAGGC
GGACATACAG GCCTTTTGGT AGGACAAAAA CTTAAAAAAG TTTCTTTAGG TAAACAAACG
CTTTGCGTTA CACATTTGGC GCAAGTTGCC GTTTACGGGG ACAAACATTT TAACGTAAGT
AAAACAAGCG ATAAAAAAAA TACCCGCGTA ACAGTAACAC AACTCGACAG TGACGCAAAA
ACTTTAGAAA TTGCTAGAAT GTTAGGTAGT ACGCACGGCA AGGATTCCGC GGGTTTTAAA
CACGCCCAGG AGCTCATTAA ACACAGCCGT GAAACGGCGT AA
 
Protein sequence
MLKNLSIKNF AILDDINLEP APGLNVFSGE TGAGKSIIIE ALGFVLGARG GSSLIKEGAG 
KMSVSASFDS SFIPKNVAAK YNICGPVINI KRELDIKGKG KGWINNTVVP AGALAELGEF
LVDFHGQHDH HTLLKTSTHL DLLDKFAKNH KHLEGIALYY NKMRDIMSKI EALHMSKTQK
EKLLDLYRFQ YKEIMDVNLK PGEDIEIDNA LPKLKHSGKL KELAKDAYNL LYEGETAGID
LISKAEKNIS EMSELDEALA PVLEEVTSSL RNLEDAAQTL YTYNENIEAD PSILDAMLSR
QQKINNLKLK YGPEIQDILN NAEIFLKQIE SLSYSEEKEQ ELQQELAEVK EKLMKGCMVL
HESRVTAASK LDSLITLEIS KLGFNDVKFK TDIIFEEESL GPKGANIVEF LFSPNPGQSL
RPLKNIASGG EMSRVMLGLK TVLAGNTPVM VFDEIDAGIG GHTGLLVGQK LKKVSLGKQT
LCVTHLAQVA VYGDKHFNVS KTSDKKNTRV TVTQLDSDAK TLEIARMLGS THGKDSAGFK
HAQELIKHSR ETA