Gene Hmuk_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1072 
SymboluvrC 
ID8410591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1020537 
End bp1022264 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content71% 
IMG OID645019408 
Productexcinuclease ABC subunit C 
Protein accessionYP_003176906 
Protein GI257387133 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.353425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0879577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGA CCGGGGTCCG CGAGGCCGCC GACGTTCTCC CCCGAGAGCC CGGCGTCTAC 
CACTTCGTCG CCGACCGCGT GCTGTACGTC GGCAAGGCCG TCGACCTGCG CGACCGGGTC
CGCTCCTACG CCGATCCGCG GTCCGCGCGG ATCGCACAGA TGGTCGAGCG CGCCGAGTCG
ATCGAGTTCA GCGTCACCGA CACGGAGACG CAGGCGCTCT TGCTGGAGGC GAACCTGATC
AAACGCCACC AGCCGCCGTA CAACGTCCGG CTCAAAGACG ACAAGTCGTA TCCGCTCGTC
CAGCTCACCG ACCACCCGGT GCCCCGGATC GAGGTCACCC GCGATCCAGA GGAGGGCGCG
ACCGTCTACG GCCCGTTCAC CGACAAAGGC CGGGTCGAGA CCGTCGTCAA GGCCCTGCGC
GAGACCTACG GGCTGCGGGG GTGTTCCGAC CACAAGTACG AGGGGCGCGA CCGACCGTGT
CTGGACTACG AGATGGGGCT CTGTACCGCG CCCTGTACCG GCGAGATTTC TGCTGTCGAC
TACGCCGAAG ACGTGGAGAG CGTCGAGCGG TTCTTCGGCG GCGAGACCGG CGTGCTGGCC
GATCCGCTCC GCCGGGAGAT GGCCGCCGCG TCGGAGGCCC AGGAGTTCGA GCGCGCGGCC
AACTGCCGGG ACAAGCTCGA AGCCGTCGAG GCGTTCCACG GCGACGCCGA CGACGCGGTC
CAGACGACCC GCGACGAACG GGCCGTCGAC GTGCTGGGGG CCGTCCGTGA GGGCGAACGC
GCGACCGTCG CCCGCCTGCA CGCCGCCGAC GGCCAGCTGA TCGATCGCGA GCGCCACGGG
CTCGACGCGC CCGACGGCGA CGGCGTCGGC GAGGTACTCT CGGCGTTTAT CACCCAGTAC
TATGCCGAGC GCGAGTTTCC GGAGGCGGTG CTGTGCTCGG AACGCCCCGG CGAGGACGTG
GTCGAGTGGC TCGCGGGCGA GGGCGTCGAC GTGCGCGTCC CCGGTGCGGG CCGAGAGGCG
ACACTCGTCG ATCTCGCGCT GAAGAACGCC CGCCGTGGCG GGCCGGCACG CGACGACACC
GCCGCGCTGG CCGACGCGCT CGCTCTCGAC TCGGCCGATC GGATCGAGGG GTTCGACGTG
AGCCACGCGC AGGGTCGGGC CGTCGTCGGG TCGAACGTCG CTTTCGTCGA CGGCGACGCC
GCCAAGCGCG ACTACCGCCG CAAGAAGCTC ACGGAGCGCA ACGACGACTA CGCCAACATG
CGGGAGCTGG TTCGCTGGCG GGCGAAACGA GCGATCGAGG ACCGGGACGA CCGCCCCGAC
CCCGACCTCC TCCTGATCGA CGGCGGCGAC GGCCAGCTCG GCGCGGCCCG GGACGCGCTG
GCCGATACCG GCTGGGACGT GCCGGCCGTG GCACTCGCCA AGGACGAGGA ACTGGTCGTG
ACTCCCACTG GCACGCACGA CTGGGACGAC GACGACCCGA AGCTCCACCT CTGTCAGCGA
GTTCGCGACG AGGCCCACCG CTTTGCCGTC CAGTACCACC AGACGGTCCG CGACGAGGTC
ACGACGGCGC TGGACGAGGT CCCCGGCGTC GGTCCCGAGA CCCGCAAGCG ACTGCTCAGG
CGTTTCGGGA GCGTCGACAA CGTTCGGGCG GCCTCGACGG AGGAGCTGCT GGCCGTCGAG
GGCGTCGGCG CGGGGACTGC CGAGACCATT CGCTCGCGGC TGTCCTGA
 
Protein sequence
MDATGVREAA DVLPREPGVY HFVADRVLYV GKAVDLRDRV RSYADPRSAR IAQMVERAES 
IEFSVTDTET QALLLEANLI KRHQPPYNVR LKDDKSYPLV QLTDHPVPRI EVTRDPEEGA
TVYGPFTDKG RVETVVKALR ETYGLRGCSD HKYEGRDRPC LDYEMGLCTA PCTGEISAVD
YAEDVESVER FFGGETGVLA DPLRREMAAA SEAQEFERAA NCRDKLEAVE AFHGDADDAV
QTTRDERAVD VLGAVREGER ATVARLHAAD GQLIDRERHG LDAPDGDGVG EVLSAFITQY
YAEREFPEAV LCSERPGEDV VEWLAGEGVD VRVPGAGREA TLVDLALKNA RRGGPARDDT
AALADALALD SADRIEGFDV SHAQGRAVVG SNVAFVDGDA AKRDYRRKKL TERNDDYANM
RELVRWRAKR AIEDRDDRPD PDLLLIDGGD GQLGAARDAL ADTGWDVPAV ALAKDEELVV
TPTGTHDWDD DDPKLHLCQR VRDEAHRFAV QYHQTVRDEV TTALDEVPGV GPETRKRLLR
RFGSVDNVRA ASTEELLAVE GVGAGTAETI RSRLS