Gene Htur_5046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5046 
Symbol 
ID8745851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp37066 
End bp38175 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content60% 
IMG OID646515659 
Productintegrase family protein 
Protein accessionYP_003406606 
Protein GI284176330 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones76 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGGG AAAAAGCTAC GTCTCGAGAC CGCAACCCCA AGGGAAAGAC CGTCGAAGAG 
ACGGTGAACC GCTACCTCGA GAAGAAACTC GAGGCCGGCG GCAGCCGAGC TACGATGAAA
CCCGTCCTCG ACGACTTCGC TGACTTCTGC AAGAGCGAGG GAATCGAGTA CGTCGGCCAG
ATCGACTCGG GCGACTGTCG CGAGTACGGA CTACGTCTCA AGACGAAGAA AGCCAACGGG
GAAATCGCTG GGTCGACGGC GAATACGTAC TTCGCCTACG TCCGGGCGTT CCTCTCGTTC
TGTGTTCGCG ACGAGCTGCT CGATACGAAT CCCGCACAGA CCGAACGGGC GGAGGAGTTC
CTTCCCGAAG ACAAATCGAC CGGCGAGACC CAGTTCTGGG AACCTGAGCA GCGGAAGCGA
CTCCTCGAGT ACGCCGACGA ACGCGTCCGG ATGGCTCGCG AGGAGACCAT CGATGTCCCG
CTTGAACGGG CCTACCGTGA CCGAACCATC GTCATTCTGC TCGCGGAACT CGGCCTCCGT
GGGGCCGAAC TCTTTCGCGA CAGAAACGAC GATGCACGGA AAGGCCTCCG GTGGGATGAC
GTCGACCTCG AGCGCGGCCG GATCGAAGTG TACGGCAAGT CACGCGAGTA CGAACCTGTT
GGACTGACCG AGGCCGCACA CGACGCCCTG TCGCGGTTCG AGCGCGTTCA AGACCCACCG
ACCGACGAGT GGCCGTTGTT CCCGACGGAT CACGCTGCGA GCAAGTACAA AGCAGTCGAG
AACGCCACGG GCGATCGGCC GGAACCAGGT AGCGATATTG ATTCAATTCT TCGCGAGCGG
GAGATCATCC CACCGTCGAT CACCAAGGAG GCCGGTCGGC AGATTCTCAA GCAGCTCACC
GACGAGGCTG GTATCGAGGT CGATGGCGAC ACGAACTATC TGCAACCTCA CGGTGCGAGA
CGGGCGCTCG GTGCTGAACT GTACGAAAAA GGCCACTCCG AGTTAGCACA AAAGGCGCTC
CGACACGAAT CGATCGAAAC CACACATAAG GCGTATTCGG ACATCCAGGC TGAGAACGTA
GCTGACTCGA TTGATGAGGT ACGGGATTGA
 
Protein sequence
MSREKATSRD RNPKGKTVEE TVNRYLEKKL EAGGSRATMK PVLDDFADFC KSEGIEYVGQ 
IDSGDCREYG LRLKTKKANG EIAGSTANTY FAYVRAFLSF CVRDELLDTN PAQTERAEEF
LPEDKSTGET QFWEPEQRKR LLEYADERVR MAREETIDVP LERAYRDRTI VILLAELGLR
GAELFRDRND DARKGLRWDD VDLERGRIEV YGKSREYEPV GLTEAAHDAL SRFERVQDPP
TDEWPLFPTD HAASKYKAVE NATGDRPEPG SDIDSILRER EIIPPSITKE AGRQILKQLT
DEAGIEVDGD TNYLQPHGAR RALGAELYEK GHSELAQKAL RHESIETTHK AYSDIQAENV
ADSIDEVRD