Gene Hmuk_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0467 
Symbol 
ID8409966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp447346 
End bp448356 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID645018790 
Productintegrase domain protein SAM domain protein 
Protein accessionYP_003176308 
Protein GI257386535 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.486802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATC TCCAACCGCT GACTCCCGAG GACGGCGTCG AGCGGTTCCT ACGGCACCGT 
GAGCCGTCCG TGCGGGAGTC GACGCTACAG AACGCCAACA CCCGGCTCAA CTACTTCCTC
GACTGGTGCG AGGAGCGCGA GATCGAGGAC CTGAACACGC TGTCTGGTCG GGACATGGCC
GACTTCGTGG CGTGGCGACG GGGCGACATC GCTCCCATCA CGCTCCAGAA GCAGCTATCC
ACGATCCGGC AAGCGTGCCG GTGGTGGGCC GACATCGACG CCGTCGAGGA GGGACTGGCC
GAGAAGATCC ACGCGCCGGA GCTGCCCGAC GGTGCCGAGA GTCGAGACGT GCATCTCGAT
CCCGAGCGGG CCGAGGCCGC GCTGGAGTAC TTCGAGCAGT ACCAGTACGC GAGCCGGGAT
CACGCCCTCA TCGCCCTCAT CTGGCGGACG GGGATGCGAC GCGGTGCCGT CCGCTCGCTC
GACGTGGAGG ACCTCCAACC GGACGATAAC GCCGTCCGAG TGGAGCACCG GATCGACGAG
GGTACGAAGC TCAAGAACGG CGAAGCCGGC GAGCGGTGGG TCTATCTCGG CCCGAAGTGG
TTCCAGGTTC TCGACGACTA CGTGAGCAAC CCCGGCCGCC CGCAGGGGAC CGACGAGTAC
GGCAGACGGC CGCTGTTCAC CAAAGAAGAC GGTGGCCGAC CGTCGGCCCA GACGATCTAC
AAGTGGCTCA TGCGGGCGCT GCACCCCTGC ACCTACGGCG AGTGTCCGCA CGATCGGACG
CCGGAGACCT GCGATGCTCG GGGTAGGACG GCCAACGTCG CAGACTGCCC GTCGTCGCGA
TCTCCCCACG CCGTTCGTCG AGGCGCGATC ACCCACCACC TGACCGAAGA CACCCCGCCG
GAGACGGTGA GCGAGCGCAT GGACGTGTCG CTCGATGTGC TGTACCAGCA CTACGACGCC
CGAACCGAGC GCGAGAAAAT GGACGTGCGA ACCGACCACC TACCGGAATG A
 
Protein sequence
MSDLQPLTPE DGVERFLRHR EPSVRESTLQ NANTRLNYFL DWCEEREIED LNTLSGRDMA 
DFVAWRRGDI APITLQKQLS TIRQACRWWA DIDAVEEGLA EKIHAPELPD GAESRDVHLD
PERAEAALEY FEQYQYASRD HALIALIWRT GMRRGAVRSL DVEDLQPDDN AVRVEHRIDE
GTKLKNGEAG ERWVYLGPKW FQVLDDYVSN PGRPQGTDEY GRRPLFTKED GGRPSAQTIY
KWLMRALHPC TYGECPHDRT PETCDARGRT ANVADCPSSR SPHAVRRGAI THHLTEDTPP
ETVSERMDVS LDVLYQHYDA RTEREKMDVR TDHLPE