Gene Htur_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0049 
Symbol 
ID8740612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp54517 
End bp55920 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content62% 
IMG OID646510612 
Productprotein of unknown function DUF21 
Protein accessionYP_003401623 
Protein GI284163344 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTAT CTCTGTCGCC CGCGGTCGTA GCCGCCGCAT ACGAGGTCCC AGTCGTGGGT 
CTCGAGTTCG ACGAGTCGAT GGTGACGATT CTCGGCAGCG TCGCCATACT CCTGCTTATC
GCGCTCTCTG CGTTCTTCTC CTCATCGGAG ATCGCGATGT TCAACCTGCC GAAACACCGC
CTCGAGGGGA TGGTCGAGGA CGGCGTCCCA GGCGCCAAAC TGGTCAAGTC CCTCAAGGAC
GATCCCCACC GGCTTCTCGT GACGATCCTC GTCGGTAACA ACATCGTCAA CATCGCGATG
TCCTCGATCG CGACGGCGAT CCTGTCGCTG CACTTCGGCG GACTGGTCGG CGTGTTTCTG
GCGACGTTCG GGATCACCGC GCTCGTCCTC CTGTTCGGCG AGAGCGTTCC CAAGTCCTAC
GCCGTCGAGA ACGCCGCACC GTGGTCGATC CGAATCGCCA GACCGCTGAA GGCGACGGAG
TACTTCCTGT TCCCGCTGAT CGTCCTCTTC GACTATCTCA CTCGACAGGT CAACAAGCTC
ATCGGCTCGA CCGGTGCGAT CGAGTCGCCC TACGTCACCC GCGACGAGAT CCAGGAGATG
ATCGAATCCG GCGAGCGCGA GGGCGTCTTG GAGGAGGAAG AACACGAGAT GCTCCAGCGG
ATATTCCGCT TTAACAACAC TATCGTCAAG GAGGTCATGA CCCCCCGCCT CGACATGACC
GCGGTCCCCA AGGACGCCGG CATCGACGAG GCCATCGAAA CCTGTATCCA GAGCGGCCAC
GCCCGCGTGC CGGTCTACGA GGGCAGCCTC GACAACGTCC TCGGCGTCGT CCACATTCGC
GACCTCGTCC GCGATCTCAA CTACGGCGAG ACGGAGGCCG ACGACCTCGA ACTCGAGGAC
CTCATCCAGC CGACGTTACA CGTCCCCGAG TCGAAGAACG TCGACGAACT GCTGACCGAG
ATGCGGGAAA ACCGGATGCA CATGGCAATC GTTATCGACG AGTTCGGCAC CACCGAGGGG
CTGGTCACCG TCGAGGACAT GATCGAGGAA ATCGTCGGCG AGATCTTGAA ATCCGGCGAG
GACGAACCGA TCGAACAGCT CGACGACCGC ACCGTCATCG TCCGCGGCGA GGTCAACATC
GAGGACGTCA ACGAGGCCTT AGAGATCGAC CTCCCCGAGG GCGAGGAGTT CGAGACCATC
GCCGGTTTCA TCTTCAACCG CGCGGGCCGG CTCGTCGAGG AGGGCGAGGA GATCACCTAC
GACGGCGTCC GTATCACCGT CGAGACCGTC GAGAACACCC GCATCATGAA AGCCAGACTG
CGAAAACTCG AGCAGCCGAC CGAATCCCTC GAGGAGGCGC CGGAGGAAGC CGAGTCCGAC
GAGGAGTCGG TCCCCTCGGA GTAG
 
Protein sequence
MALSLSPAVV AAAYEVPVVG LEFDESMVTI LGSVAILLLI ALSAFFSSSE IAMFNLPKHR 
LEGMVEDGVP GAKLVKSLKD DPHRLLVTIL VGNNIVNIAM SSIATAILSL HFGGLVGVFL
ATFGITALVL LFGESVPKSY AVENAAPWSI RIARPLKATE YFLFPLIVLF DYLTRQVNKL
IGSTGAIESP YVTRDEIQEM IESGEREGVL EEEEHEMLQR IFRFNNTIVK EVMTPRLDMT
AVPKDAGIDE AIETCIQSGH ARVPVYEGSL DNVLGVVHIR DLVRDLNYGE TEADDLELED
LIQPTLHVPE SKNVDELLTE MRENRMHMAI VIDEFGTTEG LVTVEDMIEE IVGEILKSGE
DEPIEQLDDR TVIVRGEVNI EDVNEALEID LPEGEEFETI AGFIFNRAGR LVEEGEEITY
DGVRITVETV ENTRIMKARL RKLEQPTESL EEAPEEAESD EESVPSE