Gene Htur_5039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5039 
Symbol 
ID8745845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp28084 
End bp29574 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content64% 
IMG OID646515653 
Producthypothetical protein 
Protein accessionYP_003406600 
Protein GI284176324 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCCG ACGACCACGA TCGACCAGCA CAGTCTCCCC GCGAACAGAT CGACGCGCTC 
GCCGAGGAGT ACGGCGTCGA TGTCGACGAC CTGTTGATCC AGAGTCGCCG CCGCGACCCG
ATGTACAAGG GCACAGACGC CGACCACGCG AAAGCCGAGT GGTTCGCCCG CCTCTGGCAG
CAAGCCGTCG AGCAACGAGA GAGCGACCGT ATCCACGTCC GCGGCGTCCA CTACACAGTC
TACATGTCCG ATATGGACGT CGAGCCACCG ACGAACTGTT CGTGGGAGAG CTACGACAAC
ACCCAGCGCT GTTATGACTA CCTCGAAGAG TGTGCTGTCC TCGCGCGAAT CCTCGGCTAC
ATCCCGCTGG ATGGGATTAT CGACAAGCGC GCGGATACGC GAACCGTCAC GGAGTATGGA
ACCCACACCC TCGAGCCCGA CCCCGAGGGT GTGAGTGCCC CGACCGGGGT TGCAACGCCG
ACGATTCCCC ATCCAGAGGC TCGCGCCGGC CTTGTCTTCG ATCCTGCGGA AATCGACTAC
TCTCAGTGGG TCGGCGGCCG CGTGGCCTCG AGCGCCCGCG AGCAACTGTC GTTTGACGAG
GCCCGCCAGT CGCCGTATCA CATCGAACTG TGGTCTGAAA AGACGCTTCC CGATTACATC
CGCGGTCCCG GTGGGCTGGC CGCCGAGTAC GGCTGCAACG TCATCGTCGA AGGCGAGGGC
GACCTATCGT TGACCGTCGC GAACGAGCTG GCCCAGCGAA TCGAGGCCGC CGGGAAGCCC
GCGGTGATTC TCTATCTTGC GGACTTCGAT CCGAAAGGCT ACGATATGCC GGCGAACATG
GCGGGCAAGC TGGCGTGGCT TCACCAGCGC GGCGATCTCG AGCAACGCGT CGCCATCGAG
CGGCTGGCCG TGACGAAAGA CCAGATCGAA CAGCTGGAAC TCCCGCGAAA ACCCATCGAG
GAGAGTACGG CGACCGGCAC CGGCGGCGTC GCGTACAACC GCCGCGTGAC CGAGTGGGAA
GAACAACACG GCGCCGGGGC GACCGAGTTG AACGCTCTCG AGCAACAGCC CGAGGAGTTC
CGCCGAATCG TTCGGTCGGC GTTGGAGCGA TACACGGACC CCGACCTCGA GTCCAAGAAC
GAACGCCGCG GCGACGAGTG GGAGGACGAC GTCGAATCAC GGATCGAGGC GCGGCTTCGC
GAGGCTGGCG CCAATGACGA TCTCGATGAC CTGGAGGCGT GGATCGACGA TTTCAACGAC
GCCTATGCGG AGGTCGCGGA CGTATTCGGG CGCTTACGCG GGATGATGGA CGACGAGTCG
GCGCTCGGGG CGTGGGAATC GATGGTCGAC GAACTGCTCG CAGACACCGA GTTTCCCGTC
GCGACCGTTC CCAAGGGCGA CGCGGCGTTG CCCGATGATC CGATCTACGA CTCGGGGCGT
TCCTACGCGG AAAATAAGAT GCGGATCGAT CGGTATCGGG CGTCGGAGTA G
 
Protein sequence
MPPDDHDRPA QSPREQIDAL AEEYGVDVDD LLIQSRRRDP MYKGTDADHA KAEWFARLWQ 
QAVEQRESDR IHVRGVHYTV YMSDMDVEPP TNCSWESYDN TQRCYDYLEE CAVLARILGY
IPLDGIIDKR ADTRTVTEYG THTLEPDPEG VSAPTGVATP TIPHPEARAG LVFDPAEIDY
SQWVGGRVAS SAREQLSFDE ARQSPYHIEL WSEKTLPDYI RGPGGLAAEY GCNVIVEGEG
DLSLTVANEL AQRIEAAGKP AVILYLADFD PKGYDMPANM AGKLAWLHQR GDLEQRVAIE
RLAVTKDQIE QLELPRKPIE ESTATGTGGV AYNRRVTEWE EQHGAGATEL NALEQQPEEF
RRIVRSALER YTDPDLESKN ERRGDEWEDD VESRIEARLR EAGANDDLDD LEAWIDDFND
AYAEVADVFG RLRGMMDDES ALGAWESMVD ELLADTEFPV ATVPKGDAAL PDDPIYDSGR
SYAENKMRID RYRASE