Gene Huta_2512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2512 
Symbol 
ID8384814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2582527 
End bp2583507 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content64% 
IMG OID644973586 
Productflap endonuclease-1 
Protein accessionYP_003131409 
Protein GI257053576 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAACG CCGATCTCCG GGATATCGCC GTCATCGAGG ACGTCAGCTT TGACGTGTTG 
GAGGGCTCGG TCGTCGCCGT CGACGCCCAC AACTGGCTGT ATCGGTATCT CACGACCACG
GTCAAGTGGA CGAACGACGA CATCTATACG ACTGCGGACG GAACGGAAGT CGCCAATCTT
GTCGGCGTCG TCCAGGGCCT GCCGAAGTTT TTCGAGGCCG ACGTCACGCC CGTGTTCGTC
TTCGACGGCG CAGTCACGGA CCTCAAGGAC GACGAGGTAC AGCGTCGGCG CGAGCAGCGC
GAACAGTACG AGGACCAACT CGAAGACGCT CGCGAAGCGG GTGACGCGGT CCGGGTAGCC
CGTCTCGAAT CCCGGACCCA GCGACTCACC GACGTCATTC TGGAGACGAC TCGTGAACTC
CTCGCGCTGC TCGACGTGCC GACCGTCGAC GCCCCAGCCG AGGGGGAAGC GCAAGCCGCC
CACATGGCCC GGCGGGGCGA TGTTGACTAC GTTGGCACCG AAGACTACGA CGCCCTCCTC
TTTGGCGCAC CCTTCACGCT CCGGCAACTC ACCAGTTCTG GTGACCCCGA GCTGATGGAC
TTCGAGGCGA CGCTTGCGGA ACACGACCTC TCCTGGGAGC AACTCGTCGA CGTCGCCCTG
CTCTGTGGGA CGGACTTCAA CGATGGTGTC CGGGGTTACG GCCCCAAGAC AGCGGTCAAA
GCCGTTCGCG AGCACGGCGA TCTCTGGGGC GTCAGCGAGA ACGAGGACGT CTACGTCGAG
AACGCCGATC GGATCCGCGA GCTGTTTCTC GATCCCGCCG TCACCGAGGA GTATACCATC
GAGACGAGTA TCGACCCTGA TCTGGCAGCC GCCCGCGAGT TCGTCACCGA CCAGTGGGCG
GTCGACGCCG AGGAAGTCGC TCGCGGGTTC GAACGGATCG AATCGTCGGT CGTCCAGACG
GGCCTGGAAG ACTGGACCTG A
 
Protein sequence
MGNADLRDIA VIEDVSFDVL EGSVVAVDAH NWLYRYLTTT VKWTNDDIYT TADGTEVANL 
VGVVQGLPKF FEADVTPVFV FDGAVTDLKD DEVQRRREQR EQYEDQLEDA REAGDAVRVA
RLESRTQRLT DVILETTREL LALLDVPTVD APAEGEAQAA HMARRGDVDY VGTEDYDALL
FGAPFTLRQL TSSGDPELMD FEATLAEHDL SWEQLVDVAL LCGTDFNDGV RGYGPKTAVK
AVREHGDLWG VSENEDVYVE NADRIRELFL DPAVTEEYTI ETSIDPDLAA AREFVTDQWA
VDAEEVARGF ERIESSVVQT GLEDWT