Gene Hlac_3570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3570 
Symbol 
ID7402485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp318350 
End bp321418 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content53% 
IMG OID643710108 
Producthypothetical protein 
Protein accessionYP_002567674 
Protein GI222481438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACTG ACTTTCTCGA ATCTCGCTCT GGTCGTCCCT GGAACTACGA TCGGGACGGA 
GACAATTTTG AATTTTACCA AGGAAATGGG TCGAGTGCAT TGGAGGTTGT CGTCGTTGAT
CACGATGAAC GACCGACCAA AGGTTTTCTC CAGAAGACGT ACACAGATCG GCGTGGTGGG
CGAGTAAACC CCGTTCTCGT TGTTGCGCTT TATGACAATT ATGCCGGGCT ATGCGGTCCC
AGCGGGGAAG AGCCGCCCGT CTATCGGGAT GTCGACCGGG GACAAGCAGA TCGTGTCTGT
GATACCGCTC TTGACGAATC AAACCGACAC GCGGCCCAAC GGTTCCTCAC TGAAATGCTT
CCCCAGCTCG ACGAGGAACT GACTGGACTT CGGAATCAAG GTCTACTCTC AACGCACGAG
CTCAAAGTCG GTGTCCCCGA ACGCGACGAC TGGGAGGACG CAACCAACCG CGCCCAACAA
GCAATTGACG ATGACCCGCG TGAGATGATC AAAGGGCTCA ACTACGAGAT CGAGCACCTC
ACCGATCAGA GCTCTGTCCT GAAGGACACA AGCGATGGCC ACGAGCGAGC TGTGGCGATG
TTCCTTCAAG AAGACGAGTC GTTTGACCAC ACGCAAGAAC GCTTTGTAGG TCAGTCACCA
GTGGCCTATG CGCTCAATGA AGCCGACAAA CGCAATCTCG AATACGTCAT CGGAAGCAGT
GGCGATACGT TACGACTGTA CACAACAAAT CCCGATGCAG GGTTTGGCTC ACGCGGTCGA
ACAGACACCT ACGTCGAGGT GAACACGAGT CTCCTTGCAG ACGAGAAGGC TGCATATCTC
TGGCTGCTAT TCTCCGCCAA TGCACTCCGA GAAGACGGCA CGCTCCATGA CATTATGGAG
CGATCGAAAG ACTACGCAGC GGCTCTTGGA GAACGACTCC GTGAACGGAT TTACGACGAT
GTTGTGCCAG ATCTGGCGGA AGCGATCGCC CGTGCACGTG ATATCGACGA TCCGACGAAG
GAACAGTTGG ACGAGACCTA TGAGATGACG TTGGTGCTCC TCTACCGGTT GCTGTTCATC
GCTTACGCTG AAGACGAAGA GTTCCTCCCG CGACGGCGTA ACGAGCGGTA CGACCGGAAT
TCTCTCAAAC AGAAGGCGCA CGACCTCCAC GACTTCATTG AGGACGACGG TGACTTCGAC
GCCGGATTCT ACGACCACTG GGACGACGTG ATGCATCTTT CGCGGGCCGT CCACCGCGGA
CACGACGAGT TAGGACTTCC CGCTTACGAG GGGACCTTAC TCTCTGAAGA TCCGGATATC
TCTCAAGCCG GTGCGAAGCT GGCGGATATT CGACTTGATA ATGCCGATTT CGGACCTGTA
TTGGCGAACC TCCTGATCGA CGAAACAGGG GACGGCTATC AGGGGCCTGT CGACTTCAGG
AATATCGGTG TCCGAGATTT CGGGGTCGTA TACGAGGGTC TGCTGGAGTC TGAGTTGTCG
CTGGCCGAAC AGCCGCTTAC AATTGACAAC GAAGGACACT ATGTCCCAGT CGATATAGAT
GGTCAACAGA CACTCGGTGA TGATCACGAG GACATAGTTG TTGAAGAAGG AGAGGTCTAT
CTTCATGGTC AGTCCGGAGA GCGAAAAGCG ACGGGGACGT ACTATACGAA ATCTCGGTTC
GTTGAGCATC TTCTTGATCA CTCACTGGAA CCCGCACTGG ATGACCATCT CGAACGTATC
GACTGGCTCC GTGAGGAAGA GGGCGAACAC GCCGCTGCGG ATGCGTTCTT CGACCTTCGG
GTGTCGGATA TTGCGATGGG ATCTGGTCAC TTCCTTGTCG GGGCTGTCGA CCGCATCGAA
TCTCGACTCT ATGCGTATCT GACCGAGAAA CCGCTGTCGC CCGTCGAGGA CGAACTCGAC
AACCTCGAAG ACGCGGCATT AGATGCCTTC GAAGACGAAG AGTACGCACC ACCGGTCGAA
CGTGGACAGC TCCTGCGTCG TCAAGTTGCC CGTCGCTGTA TCTACGGAGT TGACATAAAC
CCACTTTCGA CCGAACTTGC ACGGCTATCG ATTTGGGTAC ATACGTTTGT TCCCGGTCTC
CCGTTAACGT TCCTTGATTA CAATCTCGTG ACTGGCGACT CGCTGGCTGG AATCGGAACG
CTCGACGAAG TGACGGATAT ACTTGATATC GAGCAGTCAT CATTAGGGAT GTTTGCTGGC
GGTCAGAGCA TAATGAACGA TATTCGAGAT GATATTGAAC AGTTGGGGAG CTTCGCCGAT
GCCAGTGCTG AACAGGTACA AGAAGCACGC AAAACTCGTG TAGAAATAGA AAAGAACCTT
GGCCAAGTTC GAGCGAGATT TGATATCTTA GCTGCGTCCC GTATAGATGA CGAGATAAAT
ACCGACCCAG TGTCTGACAC TGGCATCGAC GTGAGAGATC ACGAGAGCTA TGAACGAGCA
AAGGACGTTC TGGAGTCTAC GAATCCTCTC CATTTCCCCG CATCATTTCC AGAAGTCTTC
GATGGTGATG ACAGCGGATT CGATGTGATC GTTGGAAATC CGCCTTGGGA ACAAGCAAAG
ATCGAACGTG ACGAGTTTTG GCGAAGACAC TACCCAGGTC TGAGTGGACT GGACAAAAAC
GAACGTGAAG AGAAGATAAC AGAGCTTGAG ACTAAAAGAC CAGATCTGGC GCTACAACTG
GAGGAAGAAC GACGAGCACA ACGGCAACGA AGCCAGATTT TAACCAACGG TCCGTATCCT
GATATGGGTC GTGGGGATCC AGATCTATAT CAGGGCTTCA GTTGGCGCTT TTGGAGACTT
ATCAGTGATT CTGGTTACCT TGGTGTTGTT CTCCCCCGAG CCGCGTTTAT CAGCCCTGGT
GCGGAGACAT TGAGATATGA GATTCTTGAA AAAGGGAATG TCACAGACCT GACTTTCTTG
AAAAACGAAC GTGAGTGGGT TTTTGACAAC GTTGAACCAC GATATACGGT AGCTCTGTTT
ACTCTTCAAA AAGAACAGAT TACTGATGGT GAAGTTCCGA TACGTGGTCC CTCATCCGTC
TTTAGTTAG
 
Protein sequence
MITDFLESRS GRPWNYDRDG DNFEFYQGNG SSALEVVVVD HDERPTKGFL QKTYTDRRGG 
RVNPVLVVAL YDNYAGLCGP SGEEPPVYRD VDRGQADRVC DTALDESNRH AAQRFLTEML
PQLDEELTGL RNQGLLSTHE LKVGVPERDD WEDATNRAQQ AIDDDPREMI KGLNYEIEHL
TDQSSVLKDT SDGHERAVAM FLQEDESFDH TQERFVGQSP VAYALNEADK RNLEYVIGSS
GDTLRLYTTN PDAGFGSRGR TDTYVEVNTS LLADEKAAYL WLLFSANALR EDGTLHDIME
RSKDYAAALG ERLRERIYDD VVPDLAEAIA RARDIDDPTK EQLDETYEMT LVLLYRLLFI
AYAEDEEFLP RRRNERYDRN SLKQKAHDLH DFIEDDGDFD AGFYDHWDDV MHLSRAVHRG
HDELGLPAYE GTLLSEDPDI SQAGAKLADI RLDNADFGPV LANLLIDETG DGYQGPVDFR
NIGVRDFGVV YEGLLESELS LAEQPLTIDN EGHYVPVDID GQQTLGDDHE DIVVEEGEVY
LHGQSGERKA TGTYYTKSRF VEHLLDHSLE PALDDHLERI DWLREEEGEH AAADAFFDLR
VSDIAMGSGH FLVGAVDRIE SRLYAYLTEK PLSPVEDELD NLEDAALDAF EDEEYAPPVE
RGQLLRRQVA RRCIYGVDIN PLSTELARLS IWVHTFVPGL PLTFLDYNLV TGDSLAGIGT
LDEVTDILDI EQSSLGMFAG GQSIMNDIRD DIEQLGSFAD ASAEQVQEAR KTRVEIEKNL
GQVRARFDIL AASRIDDEIN TDPVSDTGID VRDHESYERA KDVLESTNPL HFPASFPEVF
DGDDSGFDVI VGNPPWEQAK IERDEFWRRH YPGLSGLDKN EREEKITELE TKRPDLALQL
EEERRAQRQR SQILTNGPYP DMGRGDPDLY QGFSWRFWRL ISDSGYLGVV LPRAAFISPG
AETLRYEILE KGNVTDLTFL KNEREWVFDN VEPRYTVALF TLQKEQITDG EVPIRGPSSV
FS