Gene Htur_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3892 
Symbol 
ID8744520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp128183 
End bp130192 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content68% 
IMG OID646514476 
ProductBeta-galactosidase 
Protein accessionYP_003405423 
Protein GI284167145 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATCG GAGTCTGTTA CTTCCCGGAG CACTGGCCGC GCGAACAGTG GGAGACGGAT 
GTTCGGCAGA TGGCCGACGC TGGAATCGAG TACGTCCGGA TGGCGGAGTT CTCGTGGCGG
GTCCTCGAGC CCGAACGCGG CGCGTTCGAC TTCGAGTGGC TCGACGAGAT CGTCGAGCTG
ATTGGCGAGT ACGGGATGCA GGCCGTCCTG TGTACGCCGA CGGCGGCACC GCCGCGGTGG
CTCGTCGAGG AACGCCCCGA GATCCGCCAG CGAGACCGTG ACGGAACCGT CAGGGACGTC
GGAAGCCGCC GGCACTACTG TTTCAACTCC GCGGCCTATC GCGAGGAGAC CGAACGCGTC
GTCCGAGCGA TGGCGGAGCG CTACGCCGAC GATCCGCGCG TCGTCGGCTG GCAGACCGAC
AACGAGTACG GCTGTCACGG CACGACACGG TGTTACTGCG ACGACTGCGC TGACGCGTTC
CGGGACTGGG TCCGCGAGGA GTACGAGACC GTCGACGAAC TCAACGAAGC GTGGGGAACG
ACGTTCTGGA GCCAACAGTA CGACGACTTC GAACAGGTCG ACCTCCCGCG GCCGACGCCC
GCGCAGGACC ATCCGGCGAT GCTGCTCGAT TTCGCCCGAT TCTCGAGCGA CAGCGTCGTC
AAGTACAATC GGCTGCAGGC GGACCTGCTT CGCGAGGCGA ACGACGAGTG GTTCGTCACG
CACAACTTCA TGAACCTGTT CGAGTCGGTG GACACCTACG ACTTCGACGA GGACCTCGAT
CTGATTTCCT GGGACTCGTA CCCGACAGGC CACGTCCAGC AGGCCGGCGG CGAGACGACG
ACCGACGAGC TCCGCGCGGG GAACCCCGAT CTGCTCTCGT TCAACCACGA CCTCTACCGG
AGCGTACTCG ACCGACCGTT CTGGGTGATG GAACAGCAAC CGGGCGACGT TAACTGGCCG
CCCCACTCGA CCCAGCCCGC GGAGGGGGCG ATGCGCCTCT GGGCCCACCA CGCGACCGCC
CACGGCGCCG ACGCCGTCGT CTACTTCCGC TGGCGGCGCT GCCTCGAGGG CCAGGAGCAG
TACCACGCCG GCCTCCGCAA GCAGGACGGG TCGGCGGATC GGGGCTACGA CGACGCGACG
CGGGCCGCCG AGGAACTGTT CGACCTCGAC CACGTCGACG CGCCCGTCGC CCTGCTTCAC
GACTACGAGA ACGCGTGGGC GCTCGGCGAA CAGCCCCACG CGCCCGACTT CGACTACTGG
CAGCTGTTGC AGTCGTTCTA CGCGTCGCTG CGAGCGCACG GCGTGCAGGT CGATATCGTC
CATCCCGAGA GCGACCTCGA GTCCTACGAC GCGGTCGTCG CGCCGACGCT CCACCTGGCG
ACCGAGTCGC TGGCCGACCA CCTGACCGCG TACGTCGAAT CCGGCGGCGA ACTGCTGCTC
GGCCCGCGGA CGGGGGTCAA AGACGCGCAC AATAAGCTCC GTCCCGATCT CCAGCCCGGT
CCGCTGTCGG AGCTCGTCGG CGCGAGCGTC GACCAACACG AGTCGCTCCC GACGCAGTTC
GAGCCGACCG TCGCCGGAAC CGACGGGACG AACGCCGAGT ACGCGTTCCG AACGTGGGCC
GAGTGGCTCG AGGCCGACGC GGCCGAGCCG CTACTCGAGT ACGCGGGCGA CGATATCGAA
GGCGGACGGA CGGCGGCCGT CCGAAACGCG GTCGGTGAGG GAAGCGTCGT CTACTGCGGT
GTCTGGCCCG AGACCGACCT CGCGAACGAC CTCGTCGGGT CGCTCCTCGA CCGCGCCGGC
GTCCGCCGGA TGGACGTGCT CCCCGACGGC GTTCGCGTCG CCCGACGCGA CGGCCACACC
TGGGTGCTGA ACTTCGGGAG CGACCCGATC GCGGTGACCC TCGAGGGGGA CGCGTCGTGG
CGACTCGGCG GTCCGGAAAT CGGTCCGTTC GATCTCGCGA TCGCCGAGAC CAACGCGGTC
GACGACCTCT CGGTACGGAT CCGAGACTAG
 
Protein sequence
MSIGVCYFPE HWPREQWETD VRQMADAGIE YVRMAEFSWR VLEPERGAFD FEWLDEIVEL 
IGEYGMQAVL CTPTAAPPRW LVEERPEIRQ RDRDGTVRDV GSRRHYCFNS AAYREETERV
VRAMAERYAD DPRVVGWQTD NEYGCHGTTR CYCDDCADAF RDWVREEYET VDELNEAWGT
TFWSQQYDDF EQVDLPRPTP AQDHPAMLLD FARFSSDSVV KYNRLQADLL REANDEWFVT
HNFMNLFESV DTYDFDEDLD LISWDSYPTG HVQQAGGETT TDELRAGNPD LLSFNHDLYR
SVLDRPFWVM EQQPGDVNWP PHSTQPAEGA MRLWAHHATA HGADAVVYFR WRRCLEGQEQ
YHAGLRKQDG SADRGYDDAT RAAEELFDLD HVDAPVALLH DYENAWALGE QPHAPDFDYW
QLLQSFYASL RAHGVQVDIV HPESDLESYD AVVAPTLHLA TESLADHLTA YVESGGELLL
GPRTGVKDAH NKLRPDLQPG PLSELVGASV DQHESLPTQF EPTVAGTDGT NAEYAFRTWA
EWLEADAAEP LLEYAGDDIE GGRTAAVRNA VGEGSVVYCG VWPETDLAND LVGSLLDRAG
VRRMDVLPDG VRVARRDGHT WVLNFGSDPI AVTLEGDASW RLGGPEIGPF DLAIAETNAV
DDLSVRIRD