Gene Hlac_3379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3379 
Symbol 
ID7402231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp131107 
End bp132456 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content58% 
IMG OID643709927 
Productputative PAS/PAC sensor protein 
Protein accessionYP_002567493 
Protein GI222481257 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCAT CGGGTTCTTT GGGGGATGTC TATGCCGAAA CGCTGGCCGT CTTCGACCGG 
CGCGACGACC CCTCTGAGCC ACTCACAACT CCAGAGGTGG CAAACTCGCT CGACGCGGCT
CGACGTACCG TCTACAAACG TCTAGAGAAG CTGGCGAGCA GTGGTGAGTT GAAAACCAAG
AAGGTAGGTG CCAACGCCCG GGTCTGGTGG CGATCGCATC CGGATGAAGG ATCGTCCGTA
AACGCGACCA ATACCCACGC GACGCCGACG CTGAGCAACG CCGAGCGCAC GGTCATCGAG
AAAATCCTTG AAGCCAGTCC GATCAGCATC GTGGTGGTCG AGCCGTCCGG GCAGATCTCC
CTCGCGAACG AACGGGCTGA AGAAATGCTT GAACTGGAGC GTGACGAGAT TACCTCTCGA
ACCTACCGCC AACCGGAGTG GAAGATCTAT TACGACGACG GCACGCCTGT CAGCGAGGAC
GAGCACCCCG TGACTCGCGT CCTGGAGACA AAAGAACCCG ATTACGGCTT CGAACACTGG
ATCGACCTCC CGAACGGAAC CGAACGCTGG CTGTCGAGCA ATTCGGCACC TGTATTGAGT
GAAGATGAAG AAGTTGAATA TGTTGTCGTG GGATTCGAAG ATACAACCCG GTTGAAAGAG
CGCGAGGACA AGCTGACGAG CGATAAACGC CGGGTGCTCG AACTCTATTC CAAGCAGTTA
TTCAGCCCGC TGCTCGACGT AGTTGACGGT GACATGCGCA TCGACGTTGA CGAAGTCGTT
CGCCTCCAAG ACGGGTCGGT CCTCCAGTAC ATCACCGGGA GGGGCATTTC GGCAAAAGAG
TTGATCGACG TGTTCGACCA GGCGTACGGT GTTGACGATA CCCGGCTGCT TCAGTCGAGC
GCCGATAAGT GCCGGGTTGA GGTCCACGTC GAGGCGCCGA CCGTGTCGCT AGTCTTTGCA
GAGTTGGGGG GACAGGTGAA ATCCTTGTTT CAGACCAACG GTGACGCAGG CCCTCTCCTC
ACGGCTGAAG TGCCAGGAGA TGTGGAAGCG AGGACGGCCG TACAGGCCGT CCGGAAGGTG
TATCCAGATA TCCGATTAGA GTCACAGGAA CTCCAGTACT CGCCGCGGCT CCTCTACGAC
ATCGTCGAAG ACGTGCTTAC CGAACGGCAG TTCACGTCAT TGCAGACGGC ATATTATGGC
GGGTATTTCG AGACGCCCCG GAAGAGCATC GGTGACGAAC TCGCCGAGCG GCTGGGGATC
ACCCGTCAAA CCTTCAATCG ACACCTTCGA CTGGCCGAGA ATACCGTCTT AGAGCAGTTG
TTCGAGGGGT CGGGAAAGGC CGTACGCTGA
 
Protein sequence
MSSSGSLGDV YAETLAVFDR RDDPSEPLTT PEVANSLDAA RRTVYKRLEK LASSGELKTK 
KVGANARVWW RSHPDEGSSV NATNTHATPT LSNAERTVIE KILEASPISI VVVEPSGQIS
LANERAEEML ELERDEITSR TYRQPEWKIY YDDGTPVSED EHPVTRVLET KEPDYGFEHW
IDLPNGTERW LSSNSAPVLS EDEEVEYVVV GFEDTTRLKE REDKLTSDKR RVLELYSKQL
FSPLLDVVDG DMRIDVDEVV RLQDGSVLQY ITGRGISAKE LIDVFDQAYG VDDTRLLQSS
ADKCRVEVHV EAPTVSLVFA ELGGQVKSLF QTNGDAGPLL TAEVPGDVEA RTAVQAVRKV
YPDIRLESQE LQYSPRLLYD IVEDVLTERQ FTSLQTAYYG GYFETPRKSI GDELAERLGI
TRQTFNRHLR LAENTVLEQL FEGSGKAVR