Gene Htur_5043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5043 
Symbol 
ID8745849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp31879 
End bp32928 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content47% 
IMG OID646515657 
Productrestriction endonuclease 
Protein accessionYP_003406604 
Protein GI284176328 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATGTA TGTCTGTACA AGAGGAAGAA CGGAGTGAAC TACTCCCACG GTTACAAAAT 
ATTGATCCGA TCGAATTCGA ACATTTTGTA GCTGATCTCT GGAGTCGACA AGGATGGGAA
ACAGAAGTAT CAACAGCATC TAATGACGAG GGTGTTGATA TTGTTGCCGA TAAACAAGTC
GGAGGAGTCG ATCATCGCCA AGTGATCCAG GTAAAACGGT ATAGCAATGG GAATAAAATT
GGACGTCCAG ATGTTCAACA GTATTACGCG CTCAAAGTAC AGGATGCAAA AGCGGATGCA
GCCGTTATCG TAACAACGTC GACGTTCACA TCAACCGCTA AAGAATGGGC AAGTGAACAT
AATGTCAAGC TTATTGACGG GGACGATTTG GTTGAGTTGA TTCAAGAGCA GCGTGCCTAT
GATCTTGTCG AAGAATATGC CCCATCGTTA TCGACGTCGT CAACTGACCC TGTCGAGCGA
TCGCAGATAA CCGAAACACA GACCGAATTG CCAGATCCAC TTGATGATGC AGAAGTGCGG
AAAAAAGCAG GTATTGGTCT GGGTGCCATC GGTCTCTATC TCATTCTGAA CCCGACTGGT
ATTGGCTATT CTATCGAGGC TGTCGGAATG CTATTTCTTC TCGGAGCAAT TGCTGTTGTG
AAGTTCCCCG AGCAGGTTTG GGCAGCTATC ACTCCAGATA AGGAAGTGAT CCGGGAATTC
TCGGATGGTG CAACGGTTAT TGAACAGAGT GAGACGGTTG AGTACGTTCC TGCAGATGAT
CGAGATCCAG TCGCATTCAA CGACTTTGAG GATATCCCAG AACGACGCCA ACAGGCGAAT
GTATACGGTT CTCTTGATCA GACATGGGGC CCTCTACAAG AACTCCCTCC AGGTAGTGTT
CCAACAGACA TTGCGGCACA AGGTCAGGGT ACTATCGTCG CGTACCGGTA TGCTGTACAC
TCAGAATCAC CAGCTTCAAT CGCACAAGAT ATGAAGATGA CCCAGCAGGA GGTCATTGAT
CATCTGACTA ATATTGCAAA ACCAGACTGA
 
Protein sequence
MICMSVQEEE RSELLPRLQN IDPIEFEHFV ADLWSRQGWE TEVSTASNDE GVDIVADKQV 
GGVDHRQVIQ VKRYSNGNKI GRPDVQQYYA LKVQDAKADA AVIVTTSTFT STAKEWASEH
NVKLIDGDDL VELIQEQRAY DLVEEYAPSL STSSTDPVER SQITETQTEL PDPLDDAEVR
KKAGIGLGAI GLYLILNPTG IGYSIEAVGM LFLLGAIAVV KFPEQVWAAI TPDKEVIREF
SDGATVIEQS ETVEYVPADD RDPVAFNDFE DIPERRQQAN VYGSLDQTWG PLQELPPGSV
PTDIAAQGQG TIVAYRYAVH SESPASIAQD MKMTQQEVID HLTNIAKPD