Gene Hmuk_2779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2779 
Symbol 
ID8412330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2670695 
End bp2671675 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content67% 
IMG OID645021124 
Productflap endonuclease-1 
Protein accessionYP_003178591 
Protein GI257388818 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAACG CAGACCTACG ATCGCTGGCG TCGCTCGAAG ACGTTCCCTT CGAGGAGCTG 
AGTGACAGCG TCGTCGCCGT CGACGCCCAC AACTGGCTCT ACCGGTATCT CACGACCACG
GTTCGGTTCA CCAGCGACGA GAAGTACACC ACCAGCGACG GGACGGAGGT GGCGAACCTG
ATCGGCGTCG TCCAGGGGCT CCCGAAGTTC TTCGAACACG ACCTGACGCC GGTCTTCGTC
TTCGACGGCG GCGTCACGGA ACTCAAAGAC GACGAGGTCG AGCAGCGCCG CGAGGCCCGC
GAGGCCCGCG AGGAGAAACT CGAAGCCGCC CGCGAGCGCG GGGACTCGAA AGCTGTCGCT
CGGCTGGACT CCCAGACCCA GCGCCTGACC GACACGATCC TCACGACGAC TCGCGAGGTG
CTGAGGCTGC TGGACGTGCC CGTCGTCGAC GCGCCCGCAG AGGGCGAGGC CCAGGCTGCC
CACATGGCAC GCCAGAACGT CGTCGACTAC GTCGGGACCG AAGACTACGA CGCGCTCCTG
CTCGGCGCAC CGCTGACGCT GCGCCAACTC ACCAGCAGCG GCGACCCCGA ACTGATGGAC
TTCCAGGCGA CGCTGGACCA CCACGGCATC ACCTGGGAGC AACTGGTCGA CGCCGCGATC
CTGATGGGGA CGGACTTCAA TCCCGGCATC GACGGTGTCG GGCCGAAGAC CGCGATCAAG
CTGGTGAAAG AGCACGGCGA CCTCTGGGGC GCGCTCGACG CCCGCGACGC CCACGTCGAA
CACGGCGACC GCATCCGAGA GCTGTTCCTC GATCCGGCGG TCACGGACGA CTACGACCTC
GATCTGGCGG TGAACCCGGA CCTTGACGCC GCCCGCGAGT ACGTCACCGG CGAGTGGGAG
GTCGACGAAG GCGAGGTCGC GCGCGCCTTC GAGCACATCG AGGCCAGCGT CGTCCAGACC
GGACTGGACG ACTGGGCCTG A
 
Protein sequence
MGNADLRSLA SLEDVPFEEL SDSVVAVDAH NWLYRYLTTT VRFTSDEKYT TSDGTEVANL 
IGVVQGLPKF FEHDLTPVFV FDGGVTELKD DEVEQRREAR EAREEKLEAA RERGDSKAVA
RLDSQTQRLT DTILTTTREV LRLLDVPVVD APAEGEAQAA HMARQNVVDY VGTEDYDALL
LGAPLTLRQL TSSGDPELMD FQATLDHHGI TWEQLVDAAI LMGTDFNPGI DGVGPKTAIK
LVKEHGDLWG ALDARDAHVE HGDRIRELFL DPAVTDDYDL DLAVNPDLDA AREYVTGEWE
VDEGEVARAF EHIEASVVQT GLDDWA