Gene Emin_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1534 
Symbol 
ID6263334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1626867 
End bp1628864 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content42% 
IMG OID642612022 
Productexcinuclease ABC, B subunit 
Protein accessionYP_001876418 
Protein GI187251936 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000060622 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.0539100000000002e-18 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGGTATAT TTAAATTAAA AGCGCCTTTC TCTCCTTCGG GGGACCAACC GCAGGCAATT 
AAAAACCTTT GCCAAAATAT TAAAAACGGA CAAACAAGGC AAACCTTGCT GGGCGTTACC
GGCTCGGGCA AAACATATAC CATGGCAAAC ATAATAGCTC AAACGGATAT GCCCGCGCTT
ATAATGTCGC CGAACAAAGT TTTAGCGGCG CAGCTTTACG CGGAGTTTAA ACAATTTTTT
CCGGAAAATT CGGTTGAATA TTTTATTTCT TATTACGACT ATTACCAGCC CGAAGCCTAT
ATACCGCAAA CAGACACCTA TATAGAAAAA GATTCTTCCA TCAATGAGCA TATTGAGCAA
ATGCGTCTTA AGGCAACAAC ATCTCTTTTA ACAAGAAACG ACGTTATTGT GATAGCCTCG
GTATCAAGTA TTTATAACAT AGGTTCGCCT GATAATTTTG CCGAAATGTG TTTATATGTA
AAAAAAGGAA TTCCTTTAAA CCGGGTGGCG GTAACATCTC TTTTAATTAA AAACCAATAT
GAAAGAAGCG AAATGGAATT TACACCGGGC AAATTCCGCC TGCGCGGCGG AAATATAGAT
ATTTTGCCCC CGTACAGAGA AACGGGCATC CGCATAGAAA TGGGCCCGCA GGCAGTAAAC
GCCCTTTACA AAATACACCC CATTACGGGC GACGTTATTG AAGAAGTTGA TGAGGAATTT
ATTTACCCTG CCAAACACTT TGTTGTAAAA GAATCAGACA TTGACCGCGC CATTAAAGAA
ATTAATGAGG AAAAAGAAGG GCGCGTGAAA GAACTTGAAG CGATTGGAAA ACCTTTGGAA
GCTTACCGCC TTAAACAAAG AACCGAATAC GATATGGAAA TGCTTAAACA GACGGGTTTT
TGCAAAGGTA TTGAAAACTA TTCAAGACCT TTGGCGGGAA GGGAACCCGG GTCCAGACCG
GACTGTTTGT TTGATTATTT TAGAAAGCAT GAGAATTTTT TAGTTTTTAT AGATGAGTCC
CACGTGGCCG TGCCGCAGGT GCGCGGCATG TATAACGGAG ACAGGAGCAG AAAACAAATG
CTTATAGATT TTGGCTTCCG CCTGCCTTCA GCCTTAGACA ACAGGCCTTT AAAATTTGAC
GAGTTTGAAA AAATTTTACC TTCCACCGTT TTTGTGTCCG CCACGCCAGG CCCTTATGAG
TTAACCGTAA GCGCGAACAA CATTGTTGAG CAAGTTATCC GCCCTACAGG CCTTGTGGAC
CCGCAGGTTT CCATACACCC TACAGCAGGC CAAATAGGCC ACTTAATAAG TAAAATTGAA
GAGCGTATTA AAAAAGGACA GCGCAGTTTA GTTCTTTCTT TAACTAAAAA AACGGCTGAA
GACCTTACCG TATTTTTTGA CGAGAAAGGA ATAAAAGCCC GATACTTGCA TTCTGATATA
GAATCTTTGG AAAGGGTTGA AATTTTACAA AAATTCAAGC AAGGGGTTTT TGACGTGCTT
GTGGGTATCA ATCTTTTAAG AGAGGGGCTT GATATACCGC AGGTAGGTCT TGTGGCGATA
CTGGGCGCCG ACAATGAAGG GTTTTTAAGA AACGAAACCA CTTTAATACA AATTTCCGGC
CGCGCGGCTA GAAACATTGA CGGCGAAGTT GTGTTATACG CGGACAGAAA AACAGATTCC
ATTAAAAACG CCCTTGCCGA AATGGACCGC AGGCGTGAAA AGCAAACCGC TTATAACAAA
GAACACCATA TAACCCCGCA GTCTATTATA AAAGCCGAAA TTGAATTTAA AGATTTTGAA
AACACTGCCA AAACCGAAGG GTTAAGAGCG TTGCACAATT TTACTGATAT TCCCAAACCA
GACAACCTGC CTAAAATGAT AAAAGAAATC GAAAGGCAAA TGAAAGACGC GGCCGACAAT
CTTAATTTTG AGCTTGCGGT CGACCTGCGT GACAGAATGT TAGAACTTAA AAGCATGAGA
GTAAAAACAA AAAAATGA
 
Protein sequence
MGIFKLKAPF SPSGDQPQAI KNLCQNIKNG QTRQTLLGVT GSGKTYTMAN IIAQTDMPAL 
IMSPNKVLAA QLYAEFKQFF PENSVEYFIS YYDYYQPEAY IPQTDTYIEK DSSINEHIEQ
MRLKATTSLL TRNDVIVIAS VSSIYNIGSP DNFAEMCLYV KKGIPLNRVA VTSLLIKNQY
ERSEMEFTPG KFRLRGGNID ILPPYRETGI RIEMGPQAVN ALYKIHPITG DVIEEVDEEF
IYPAKHFVVK ESDIDRAIKE INEEKEGRVK ELEAIGKPLE AYRLKQRTEY DMEMLKQTGF
CKGIENYSRP LAGREPGSRP DCLFDYFRKH ENFLVFIDES HVAVPQVRGM YNGDRSRKQM
LIDFGFRLPS ALDNRPLKFD EFEKILPSTV FVSATPGPYE LTVSANNIVE QVIRPTGLVD
PQVSIHPTAG QIGHLISKIE ERIKKGQRSL VLSLTKKTAE DLTVFFDEKG IKARYLHSDI
ESLERVEILQ KFKQGVFDVL VGINLLREGL DIPQVGLVAI LGADNEGFLR NETTLIQISG
RAARNIDGEV VLYADRKTDS IKNALAEMDR RREKQTAYNK EHHITPQSII KAEIEFKDFE
NTAKTEGLRA LHNFTDIPKP DNLPKMIKEI ERQMKDAADN LNFELAVDLR DRMLELKSMR
VKTKK