Gene Htur_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1081 
Symbol 
ID8741669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1130981 
End bp1131958 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content67% 
IMG OID646511660 
Productflap structure-specific endonuclease 
Protein accessionYP_003402646 
Protein GI284164367 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAACG CTGCACTCCG CGACATCGCC GTCATCGAGG AGCTCCCCTT CTCCGAGATC 
GAGGGCGTCG TCGCCGTCGA CGCCCACAAC TGGCTCTACC GGTACCTAAC GACGACGGTC
AAGTGGACGG ACAGCGACGT GTACACGACC GCCGACGGAA CCGAGGTCGC CAACCTCGTC
GGCATCGTCC AGGGGCTGCC CAAGTTCTTC GAGAACGACA TCACGCCGGT GATGGTCTTC
GACGGCGGAC CCTCCGAACT CAAGGAAGAC GAGATCGAGT CCCGCCGCGA TCAGCGTCGC
ACCTACGAGG AGCAACTCGA GGTCGCCCGC GAGGAGGGCG ATCAGGTCGC CATCGCGCAA
CTCGAGTCCC GGACCCAGCG GCTGACGCCG ACGATTCAGG AGACCAGCCG CGAGCTGCTC
CGACGGCTCG ACGTCCCGAT CGTCGAGGCG CCCGCGGAGG GCGAGGCCCA GGCCGCGCAC
ATGGTCCGGC GCGGCGACGC CGACTACGTC GGCTCGGAGG ACTACGACGC CTTGCTGTTC
GGCTCTCCGC TCACGCTGCG CCAACTGACG AGCAAGGGCG ATCCCGAACT GATGGACCTC
GAGGCGACCC TCGATCACCA CGACCTCACG TTAGAGCAGC TGATCGACGC GGCGATCCTC
ATCGGGACGG ACTTCAACGA GGGCGTCTCG GGGATCGGGC CGAAGACCGC TATCAAAGCC
ATCACCGAAC ACGGCGACCT CTGGAGCGTC CTCGAGGACC GAGGCGCGCA CATCGAGTAC
GGCGACCGGG TCAGACAGCT GTTCCGCGAC CCCAACGTGA CCGACGACTA CGAGTTCGAC
ACGGACCTCG ATCCGGACCT CGAGGCCGCC AGGGAGTACG TCTGCGAGGA GTGGCGCGTC
GACGAAGGCG AAGTCGACCG CGGCTTCGAG CGCATCGAGG AGAGCGTCAC GCAGACGGGG
CTGGACCGCT GGACCTGA
 
Protein sequence
MGNAALRDIA VIEELPFSEI EGVVAVDAHN WLYRYLTTTV KWTDSDVYTT ADGTEVANLV 
GIVQGLPKFF ENDITPVMVF DGGPSELKED EIESRRDQRR TYEEQLEVAR EEGDQVAIAQ
LESRTQRLTP TIQETSRELL RRLDVPIVEA PAEGEAQAAH MVRRGDADYV GSEDYDALLF
GSPLTLRQLT SKGDPELMDL EATLDHHDLT LEQLIDAAIL IGTDFNEGVS GIGPKTAIKA
ITEHGDLWSV LEDRGAHIEY GDRVRQLFRD PNVTDDYEFD TDLDPDLEAA REYVCEEWRV
DEGEVDRGFE RIEESVTQTG LDRWT