Gene Hlac_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2212 
Symbol 
ID7401147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2194713 
End bp2196770 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content68% 
IMG OID643709284 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002566859 
Protein GI222480622 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.169636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGC AAGAGGCGGA GCGGGAGTTC GCGGACGACG TGGTCGTCCG GGAGACGCTC 
CCGCGCATGT TCGAGCGGTC CGCCGAGCGC AACGCGGACC GGCTTGCCCA GCGGTACAAA
GGCGGTATCT ACGACCGCTC GCTGGTCTCT AGCGGGGTGC TCCCGCCCGC CTCGGAGGGC
GAATACGCCG ACGTGACCTA CGCCGAGATG CGCGCAATCG TCCGGCGGCT CGCCGCCGGG
TTCCGCGAAC TCGGCGTCGA CGACGACACC CGCGTCGCCC TATACGCGCA GACTCGGATG
GAGTGGGCCC AGACCGACTT CGCCGTGCTG GCCGCGGGCG CGACGGTCAC GACCGTGTAC
GCCTCCTCCT CGCCGAACCA GGTCCGCTAC CTGATCGAGG ACCCGGAGGC GACCGTCGTG
GTCGCGGAGA ATCGCGAGTT GCTCGACGAG GTGTTCGCGG TTGTCGACGA CCTCGAACAC
GAACTCGACG CGATCGTGAC GTTCGACGAC GTGGACACGA CCGACCTCGA CGCGGAGATC
GGTACCGATT CAGAGAACCT GTCCACCGAC GACGTGTACA CCCTCGGTGA GGTCCACGCC
CTCGGCGACG AGGCGTTCGA CGAGGACGCC TACCAGTCGT GGATCGACGC GGTCGACGTC
GGAGATCTTG CGAGCCTCAT CTACACCTCC GGGACCACGG GCCAGCCCAA GGGCGTCCGG
CTCACTCACG CCAACTTCAG GGAGAACGTC TCGCAGTGTT ACCGGCGCTT CGCGGATCGC
CCGGACCGCG ACAGCGACGT ACCGGGGATC AGCGCCGAGT CGACGACGCT CTCGTTTCTC
CCCCTCGCGC ACGTCTTCGA GCGGATGGCG GGCCACTACA TGATGTTCGC GGCGGGCGCG
ACCGTCGGCT ACGCCGAGAG CCCCGACACC CTTCGGGAGG ACTTCGGGCT GGTACGCCCG
ACGACGACCA CGAGCGTCCC GCGCGTCTAC GAGAAGCTGT ACGACGCGAT CCGCGAGCAG
GCAGGCGAGT CTCCCGTGAA AGAGCGGATC TTCGAGTGGG CAGTCGGCGT GGGGAAGGCC
CACCACGAGG CCGACGAACC GGGCGCCGTA CTCAACGCCA AGTGCGCGGT CGCCGACCGG
CTCGTCTTCT CGTCGGTTCG CGAGGCGATC GGCGGTAACA TCGACTTCTT CATCTCCGGC
GGCGGGTCGC TGTCGGCGGA GCTGTGCGCG CTGTACCACG CGATGGATCT GCCGATCCTG
GAGGGGTACG GGCTGACGGA GACCTCCCCC GTCATCAGCG TCAACCCGCC CGAGGAGCCG
AAGGTCGGCA CCATCGGTCC GCCGGTGGTC GACACCGAGG TCGCGATCGA CGGCGCGGTC
GTCGGCGAGG AGGTCGCCGA TCTGCCCGGC GACGTCGGCG AACTGCTGGT TCGCGGTCCA
CAGGTGACCG ACGGCTACTG GAACCGCCCG GACGCGACCG CCGAGGCGTT CACCGACCCG
GACCGGCTCC CGGAGGACGC GGTCGTGGCG GGCGACCCGC CCGAGGAGCG CGGCGGCGAC
CCGGACGACC CCTGGTTCCG CACCGGCGAC ATCGTCCAGC TCCGACCGGA CGGGTACATC
GCGTTCCGCG AGCGCGCCAA GCAGCTGCTC GTGCTCTCGA CCGGGAAGAA CGTCGCGCCC
GGTCCGATCG AAGACCGGTT TGCCGCCAAC GAGTTCGTCG AACAGTGCGT CGTGCTCGGC
GACGGCCGGA AGTTCGTCTC CGCGCTGATC GTCCCGAACT TCGAGAAGCT GGCGGCGTGG
GCCGACACCC GCGGGATCGA TATCCCCGAG GACCCCACGG GGATCTGTCG CGACGACCGG
GTCCGCGAGC GGATCCAAGT GGAGGTCGAC CGCGTCAACG AGGAGTTCGA GTCGTACGAG
CAGATCAAGC GGTTCCGGCT CGTCAAGGAG GAGTTCACCG AGGAGAACGA CCTTCTCACC
CCGACGATGA AAAAGAAGCG GCGTAACATC TTGGATCGGT TCGGCGACGA GATCGAGATC
ATTTACGAGG ACGCGTAG
 
Protein sequence
MNWQEAEREF ADDVVVRETL PRMFERSAER NADRLAQRYK GGIYDRSLVS SGVLPPASEG 
EYADVTYAEM RAIVRRLAAG FRELGVDDDT RVALYAQTRM EWAQTDFAVL AAGATVTTVY
ASSSPNQVRY LIEDPEATVV VAENRELLDE VFAVVDDLEH ELDAIVTFDD VDTTDLDAEI
GTDSENLSTD DVYTLGEVHA LGDEAFDEDA YQSWIDAVDV GDLASLIYTS GTTGQPKGVR
LTHANFRENV SQCYRRFADR PDRDSDVPGI SAESTTLSFL PLAHVFERMA GHYMMFAAGA
TVGYAESPDT LREDFGLVRP TTTTSVPRVY EKLYDAIREQ AGESPVKERI FEWAVGVGKA
HHEADEPGAV LNAKCAVADR LVFSSVREAI GGNIDFFISG GGSLSAELCA LYHAMDLPIL
EGYGLTETSP VISVNPPEEP KVGTIGPPVV DTEVAIDGAV VGEEVADLPG DVGELLVRGP
QVTDGYWNRP DATAEAFTDP DRLPEDAVVA GDPPEERGGD PDDPWFRTGD IVQLRPDGYI
AFRERAKQLL VLSTGKNVAP GPIEDRFAAN EFVEQCVVLG DGRKFVSALI VPNFEKLAAW
ADTRGIDIPE DPTGICRDDR VRERIQVEVD RVNEEFESYE QIKRFRLVKE EFTEENDLLT
PTMKKKRRNI LDRFGDEIEI IYEDA