Gene Hlac_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0820 
Symbol 
ID7400786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp817935 
End bp819551 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content64% 
IMG OID643707886 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002565489 
Protein GI222479252 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.594452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC CGCTACTAAC GACGGACTTC TTGGACCGAG CACGGCGCCA CTACGCCGAC 
GAGGAGGCCG TCCTCGCCAC TGACGGGACG CGGTACACCT ACGCCGAGCT GGGCGAGCGC
GCGGACCGCT TCTCCGCCGT GCTTCAAGAG TGCGGGATCG AGAAGGGCGA CCGGGTTGCG
GTGTTGGACC CGAACACTCA CTACCACCTA GAAGCCGCCT ACGGCGCCAT GCAGATCGGC
GCAGTTCACA CTCCACTAAA CTACCGGCTC ACGCCCGACG ACTTCTCGTA CATGCTCTCC
GACGCTGGCG TCGACGCTAT CTACGCCGAC GCCGAATACG CCGCGAACGT CGAGGCGATT
CGCGAGGAGG TGCCAACCGA GACGTTCCTC ACGAACGACG CCGACGCGAT CGAGGGTGAT
TGGGAGTCGT TCGACGAGGC GCTCGCCGAC GCGAATCCCG ACGCCTACGA GCGCCCGGAG
ATGGATGAAG ACGACGTGAT CACCATCAAC TACACCTCCG GGACCACGGG CGATCCGAAA
GGGGTCTGTC GCACGCACCG CGCGGAGACG CTCCACGCCT ACCTGATCAC CATCCACCAG
GAGATCACCG ACGACGACGT GTACCTCTGG ACGCTGCCGA TGTTTCACGT CAACGGCTGG
GGACACATCT ACGCGATCAC GGGGATGGGC GCCCGTCACA TCTGTACCCG CGGCGTCGAC
GTCGAGGCCG TGTTCGACCG GATCCGCGCC GAGGACGTGT CGTACTTCTG TGCGGCGCCG
ACCGTGCTCA ACATGCTCGG CGACCACTAC GCCGACCACG GCGGCGCGAC GACCGGCGAC
AACGACGTGC GGGCAGCCAC CGCGGGCGCG GCGCCGCCGG AGGCAACGAT CCGCACCGTT
GAGGAAGAGT TCGGCTGGGA TCTCAAACAC GTGTACGGCG CGACCGAGAC GGGGCCGCTC
GTGACGACAT CGGATGCCAA GCGTCACTTT GACGCCGACT CGGACGACCG GTTCGCGGTC
AAGAAGACAC AGGGGATCGG CTACCTCGGT ACCGACGTGC GCGTCGTCGA CGAAAACGGC
GAGGACGTGG CTCCCGACGG CGAGACGATC GGCGAAATCG TTGTTCGGGG CAATCAGGTA
ATGGACCGCT ACTGGAACAA GCCCGATGCC ACCGAAGAGG CGTTCTCAGA GCGGCTGGAG
GGATACTACC ATATGGGAGA TCTGGCCGTC GTCGACGAGG ACGGCTTCGT CTCGATCCAA
GATCGAAAAA AGGACATTAT CATCTCTGGC GGGGAGAACA TCTCCTCGAT CGAGTTAGAG
GACACCCTCT TCGAGCACGA TGTCGTCTCA GACGTGGCCG TTATCCCCGC TCCCGACGAG
CGGTGGGGCG AGACCCCGAA GGCGTTCGTG GTCCCGGAGA GCGGCGACCC GGACGACGCG
GGTGCGACAC CGGAGGAGCT CAAGGCGTTC GTTCGAGAGC GCGTCGCTGA CTACAAGACT
CCGGGCGAGG TGGAGTTCGT CGCTGAACTT CCGACGACGG CAACCGGGAA GATCCAGAAG
TACGAGCTAC GCGAGCGCGA GTGGGACGAG GAGGACCGGA TGGTCGGGGA AGGGTAG
 
Protein sequence
MRKPLLTTDF LDRARRHYAD EEAVLATDGT RYTYAELGER ADRFSAVLQE CGIEKGDRVA 
VLDPNTHYHL EAAYGAMQIG AVHTPLNYRL TPDDFSYMLS DAGVDAIYAD AEYAANVEAI
REEVPTETFL TNDADAIEGD WESFDEALAD ANPDAYERPE MDEDDVITIN YTSGTTGDPK
GVCRTHRAET LHAYLITIHQ EITDDDVYLW TLPMFHVNGW GHIYAITGMG ARHICTRGVD
VEAVFDRIRA EDVSYFCAAP TVLNMLGDHY ADHGGATTGD NDVRAATAGA APPEATIRTV
EEEFGWDLKH VYGATETGPL VTTSDAKRHF DADSDDRFAV KKTQGIGYLG TDVRVVDENG
EDVAPDGETI GEIVVRGNQV MDRYWNKPDA TEEAFSERLE GYYHMGDLAV VDEDGFVSIQ
DRKKDIIISG GENISSIELE DTLFEHDVVS DVAVIPAPDE RWGETPKAFV VPESGDPDDA
GATPEELKAF VRERVADYKT PGEVEFVAEL PTTATGKIQK YELREREWDE EDRMVGEG