Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3482 |
Symbol | |
ID | 7402328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | + |
Start bp | 226640 |
End bp | 229624 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643710023 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_002567589 |
Protein GI | 222481353 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGACT TGACGGCTGC ACTCCGCGAG ACGCTCGACA CGTTCGCGCC CGACGGGACG CCGCTGACGA CGAGCGAGGT GGCTGAGGCA CTCGACCTCG GTCGACGGAG CACGTACGAC AGACTGGACC GACTCGTGGA CGCCGATGAA CTACGCACGA AGAAGGTCGG CGCGAGCGCG CGCGTCTGGT GGCGGAACGA CACAGGTGAC GCCGCTCTAC CTGACGCTAA AATCGCTGAC GCCTCGACGA ACGGAGCCAG TACCGATGCG GGAGCGGGTG CGGTCGAGAA TCCCCCGCTC GTGGACGTGC TCGAGTCCTC GCCGACCGGC GTGGCGGCGT TCGCGGACGA CGGCACGTGC GCGACCGCCA ACGAGCAGTT CCGGACGCTG TTCGGGCTCG ACGGCGACGT CTCGCTCGAG GCTCTCGAAG CGGTGCCGTT GACCGACCAC GACGGTGACG TGATCGGCGC CGCCGACCGA CCGGTTCGTC GAGTTCGTAG GACGAACCGT CCGGTCGTCG ACGAGCACGT ACGCGTCGAC ACCGACGACG GGCATCGATG GGTGTCGATG ACTGTACAGG AGTCCGACGA CGGAGTCGTC GTCACGGCGT CGGACGTGAC GGGCGTCGTC GAACGGTCGC GTCGACTGCG ACGGGAGCGT GACGCTGTCG CGGCCGAGCT ATCGGAGTTC GCGTCCCACG CGGTCGACAG CCGACTGAAG CTGGACGAAG ACGGGACGGT GCTGTCCGTC GACGACCGTG CCGCGGCACT CCTCGAACCG GACGTGGTAG AACTCGTCGG TGCGTCGACG AGCGAGGCGT TCGATGCACT CAGGGGCGCA AGCGCGATCG TCGACGCGGC GCTCGAGTCG GAGTCGAAGC AGACGGGCGA CTGTCGGCAC GGCGACCGCG ACGCGTGGTT CGAGGTCGAG GCCGTCCCGA CGACCTCGGG CGCGTCCGTG TTGCTCCGGG AGGTCACCGA ACAGGTGGAG CACGAACGCG AACTCCAGCG GTACGTCGGC GTGGTCGACG CACTCGGCGA ACCGGTGTAC GAGCTCGATA GCGAAGGCCG GTTCGCGTTC GTCAACGACG CCATCACGGA GCTCTCGGGG TACTCGCGCG AGGAACTGCT CGGCGAGCAC GTGTCGCTCG TGATACCGGA CGATGCCGTG GACCGCATCG AGCCGCAGAT CAGCGAACTC TTGACGGAGG ACGCCCCGGA CCGAGTGCGC TCGGAGTACC ACGTGACCAC GAAGCGCGGG CACGCAGTGC CCGTCGAGAA CCGGTTGACG GTGCTCACGG ACGAAGCGGG GAACGTCCGC GGGAACGCCG GCTTCGTCAG CGACATCACT GAACGCAAGG AACGAGAGCG GGAACTGGAG CGCTACGAGC GCATCGTCGA GACGGTCGAG GACGGCATCT ACGTGCTCGA CCAGGAGGAC CGTTTCGTCG TCGTGAACGA CGCGTTCGCG TCGATGGCTG GCGTCGACGG CGAGGATATC ATCGGCCAGA AGGCGTCCAT CGTCTTCGAC GAGACGTTCG CGGAGCGAGT GAACGACAGG AATGCGGCGC TGTCAGCGGA CACGCTCGAG AGCGCGAAGT TCGAAGAGAC GTTCGCGCCG GTCGACGGCG ACCCGCTCGT GGTGGAGACG CGGTTCACGA CGTTCGCGTC CAAGGACGGG AACACCGGAC GCGTCGGTGT GGTGCGGGAC GTCGGGGAGC GCGTCGAGCG GGAACGACGC ATCGAACGCC AGCGGGCGCG TCTCGAGGCG CTCAACGAGG TGAACGCGGT GGTTCGCGAC GTCGCGACGG GAGCCATCGA CGGGTCAACT CGCGAGGAGA TCGAGACGAT GGTCTGCAAG CGTCTCGCCG CGTCGAACGC CTACGAGTTC GCGTGGATCG GGGAGATGAC TGGCGTCGAC GGGTCGCTCG CCGTCCGAAC CGCGGCGGGT CTCGACGACC CGGACCGCGC TTCGCTGTCG GGGATGCTCG AGGGAGTCCG TGGTCGCGGT GCGGTATCTC GTGCGGTCCG CGACCGGACG GCACAGACGG TCCAGGACGC GTCGTTGTTG CCGACGTCGG ACCCATGGCG GGCGCTCGCC GCCAGGTTCG GGTTCCGGTC GGCGATGGCG ATCCCGATAA CGCACGACGG CCGGATGTTC GGCGTCCTGA ACGTGCATAC CGACCGGGAG TCCGCGTTCG CTGACGAGGA ACGACGCCTC GTGGAGCACA TCGGTGAGGT CGTCGGGCAC GCGATCGCGG CCGTGGAGCG CAAGCGTGCG CTCGCGAGCG AAGCCGTCCT GGAACTGGAC TATCGGGTGC CGAGGGCGTT CTCGTCGCTC GACGTCTCCG AGTCGCTGTC GGGCACGCTC ACGTTCGACG AGACGGTGTC GACGAAGGGC GATGAGGTGC TCGTGTACGG AACGGCGTCG CCGACGGCGA TGGAGTCGCT GGCGTCCCTC GTCGACGAGG TACCGTACTG GGAGTCGGTG TCGGTCGTCG ACACCGACGA GGCCGGTAAC TCGAAGTTCG AGTTGCACGC GAAGGAACCA CCCGTGTTCT CGATGGTGAC GGCACGCGGC GGGTACGTGG ACGAGGTCGT GACCGTGGAC GGCAACTCCA GGTTCGTGTT GCACGTGCCG CCCGGACAGG ACGTCCGTGC GATCACTGAG GGGGTGTCGG CGGCGTATCC GTCGGCGGAA CTCGTCGCTC AGCGGCAGAT ATCTCCATCG CAGCCGTCGA TGGCGAGACT CCAGGACCGT ATCGCGGAGG ATCTCACGGA CCGTCAGCGT GCCGCTCTGT ATGCGGCGTA CCACTCGGGG TTCTTCGAGT GGCCGCGAGC GGCGACCGGC GAGGACGTCG CAGAGTCGCT CGGGGTGGCA CCGCCAACGT TCAACCAACA CATCAGGAAG GCGGAGCGGA AGGTGTTCGA GGCGCTACTG GGGGAGGGGG GCGAGGAGTC CTCGGGTTGG ACTGACACCG AGTAG
|
Protein sequence | MGDLTAALRE TLDTFAPDGT PLTTSEVAEA LDLGRRSTYD RLDRLVDADE LRTKKVGASA RVWWRNDTGD AALPDAKIAD ASTNGASTDA GAGAVENPPL VDVLESSPTG VAAFADDGTC ATANEQFRTL FGLDGDVSLE ALEAVPLTDH DGDVIGAADR PVRRVRRTNR PVVDEHVRVD TDDGHRWVSM TVQESDDGVV VTASDVTGVV ERSRRLRRER DAVAAELSEF ASHAVDSRLK LDEDGTVLSV DDRAAALLEP DVVELVGAST SEAFDALRGA SAIVDAALES ESKQTGDCRH GDRDAWFEVE AVPTTSGASV LLREVTEQVE HERELQRYVG VVDALGEPVY ELDSEGRFAF VNDAITELSG YSREELLGEH VSLVIPDDAV DRIEPQISEL LTEDAPDRVR SEYHVTTKRG HAVPVENRLT VLTDEAGNVR GNAGFVSDIT ERKERERELE RYERIVETVE DGIYVLDQED RFVVVNDAFA SMAGVDGEDI IGQKASIVFD ETFAERVNDR NAALSADTLE SAKFEETFAP VDGDPLVVET RFTTFASKDG NTGRVGVVRD VGERVERERR IERQRARLEA LNEVNAVVRD VATGAIDGST REEIETMVCK RLAASNAYEF AWIGEMTGVD GSLAVRTAAG LDDPDRASLS GMLEGVRGRG AVSRAVRDRT AQTVQDASLL PTSDPWRALA ARFGFRSAMA IPITHDGRMF GVLNVHTDRE SAFADEERRL VEHIGEVVGH AIAAVERKRA LASEAVLELD YRVPRAFSSL DVSESLSGTL TFDETVSTKG DEVLVYGTAS PTAMESLASL VDEVPYWESV SVVDTDEAGN SKFELHAKEP PVFSMVTARG GYVDEVVTVD GNSRFVLHVP PGQDVRAITE GVSAAYPSAE LVAQRQISPS QPSMARLQDR IAEDLTDRQR AALYAAYHSG FFEWPRAATG EDVAESLGVA PPTFNQHIRK AERKVFEALL GEGGEESSGW TDTE
|
| |