Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1305 |
Symbol | |
ID | 7399400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1316216 |
End bp | 1318012 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643708369 |
Product | Bacterio-opsin activator HTH domain protein |
Protein accession | YP_002565967 |
Protein GI | 222479730 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGAGG GACTTGACGC CGAGGAGTAT GAGGCGCTCG TGACCGGGGC GGAGACGTAC CGCGCCGCGC TGGTCGTCCG TCTCGGCGGC GAGGCGGGGC TCCGAACTGG CGAGATTACG TGCGTCACCC CGGGACACCT CCGTGAGACG GAAGGCGACG CGGATCTCTC CCTCCTCGCG GTTCCATCGG ACGACACGGA AGGGGAAACC AAAGCGGGCG ACGATCCCGA GACGGAGGGC GCACCACGGG ACCACAGCGG GATCGACCGC GAAACGGCGA TTCCGGCGTC ACTGGCAGCC GAGCTGCGGC GGTACGCAGA GAGCGCCGAC CTCCGCGAGT CGGAGCCATT CGTCGACGTG TCGCCCCGCC GCGTCCAGAT GATCGTGAGC GAGACCGCAG AGCACGCGGC CGCGCGGACC GACGGCCTCG TCGACCCCGA CGTGACCCCG CGAGATCTCC GACGGACCTT CGCGCGACGA CTCCTCGTCG ACCGCGGCGT CGACCCCCAC GCGGTTCGCG AGGCCGGCGG CTGGGAGACG ATGGCGACGC TCGACGGTTA CCTCGGGGCG CTCGACGGGG ACGCAATCGC CGAGGCGATC GCCGGCGATC GGGCCGGGTC CTCGGACGGA CCGGCGGCAG AATCTGCGCC CACCACCCTT GGCGGGTTCG AGGCGCTCGC CGACGGAGAC GACCGGAAGA CGCCCCTCGC GACGGTTCCC GGCGGCGTAG TTGAGGCCGA TCGCTGGGCC GAGGCGTGGG TCGCCCGCGG GATGGGGGAC CGAGACCGCG TCGAGATCGC GGGCGCGGCC GGGGCCGACC GAGAGACCCT CGTCGATCGC GGTGCGACCG CCGACGGTCC GTGTCGTGAC GCGGTCGAGG CGGGAGAGCC GGTCGCGACC GAGGGATCGC CCGCGACCGC GGGTCGACCG GCGATCGCGG TTCCCGTTCG GTACCGCGAC GTGACGCACG GCGCGCTGTG CGTCGTTGCC GGCGGAGAGC CGCCAGTCTC TCCCGTCTCG CCGGCCGAGC GCCGGGAGAT CGCGGCGCTC GGCCGGTGCC TCGGGTGGGC CGTGACTGCG GGGCGCTGGC GCGACCTGCT CCACTCCGAC GCGGTGACCG AGGTGGAGTT CCACACCGGC GACGAGGGTG CGTTCCTCTC CCGCGCGAGC GCGGCGCTCG GCTGTCGGAT CGATCTGGCC TCGACGGTGG CGGTCGACGA CGACGCCTCT CGCTTCTACC TCTCCGTCGA AGGGGCGCGC CCGCAGGCGC TGGCCGACGC AGTCGCGGGT GCGTCCGGCG TCTCGGATCT CCGCGTGATC GAGACCCGAG AAGACGGTTG CGACGTGTCC GTGCGCGTGG AGGGCGGGTC GGCGGTCCGA GCGCTCACCG AGCACGGCGC GACCGTTCGC GACGCGACCG CGGAGGACGG GCGGGTCAGG GTCGTCGCAG ACCTTCCGGA AGGCGCCGAT GTCCGCCCCG TCGCCGACGG GTTCCGGGCT ACCTTCGCCG ACGCGCGACT CGCGAGCAAG GAGTCTGTCG CGCGGCCGGC CCGTAGCGAG GACTCGTTGC GAGACGGAGT CGCGGAACGG TTCACCGACC GCCAGTGGGC CGCGCTGTCG GCTGCGTACC ACGGCGGATA TTTCGACTGG CCGCGCGGGA GCACCGCCGA GGAGGTCGCC GACGCCATGG ACGTCTCCTC GCCGACGTTT CACAATCACC TCCGGAAGGC CCAACGCAGA CTGCTCGACG AGCTCTTCGA GGACGGCCGG CGGGCCCGTC GCCTCGATCA GGGGTGA
|
Protein sequence | MVEGLDAEEY EALVTGAETY RAALVVRLGG EAGLRTGEIT CVTPGHLRET EGDADLSLLA VPSDDTEGET KAGDDPETEG APRDHSGIDR ETAIPASLAA ELRRYAESAD LRESEPFVDV SPRRVQMIVS ETAEHAAART DGLVDPDVTP RDLRRTFARR LLVDRGVDPH AVREAGGWET MATLDGYLGA LDGDAIAEAI AGDRAGSSDG PAAESAPTTL GGFEALADGD DRKTPLATVP GGVVEADRWA EAWVARGMGD RDRVEIAGAA GADRETLVDR GATADGPCRD AVEAGEPVAT EGSPATAGRP AIAVPVRYRD VTHGALCVVA GGEPPVSPVS PAERREIAAL GRCLGWAVTA GRWRDLLHSD AVTEVEFHTG DEGAFLSRAS AALGCRIDLA STVAVDDDAS RFYLSVEGAR PQALADAVAG ASGVSDLRVI ETREDGCDVS VRVEGGSAVR ALTEHGATVR DATAEDGRVR VVADLPEGAD VRPVADGFRA TFADARLASK ESVARPARSE DSLRDGVAER FTDRQWAALS AAYHGGYFDW PRGSTAEEVA DAMDVSSPTF HNHLRKAQRR LLDELFEDGR RARRLDQG
|
| |