Gene Hlac_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0224 
Symbol 
ID7402153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp242228 
End bp243877 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID643707287 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002564899 
Protein GI222478662 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0391267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0314439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAG AGCTCACGCT CCGCCCGTTC TTCTGGCGGG CGACCCGACT GTTCCCGGAG 
ACGGAGGTCG TCTCCCGCAC CCACGACGGG GTCGACCGCT ACACGTACGC CGACTTCGGC
GACCGCGTCC GGCGGCTCGC GGCCGCGCTC GCGGAGTTGG GCGTCGAGCC CGGCGACCGC
GTCGGAACGC TCGGGTGGAA CACCCACCGG CACTTCGAGG CGTACTACGC GGTCCCCCTC
TCGGGCGCCC AGCTCCACAC GGTGAACCTC CTCTTACAGG ACGACCACGT CGAGTACATC
ATCAACGACG CCGCGGACGA CGTGCTGATC GTCGACCGCG ACGCCGTCGC GACGCTCGAC
CGACTGTGGG ACCGGATCGA CGGCGTCCGC GAGGTCGTCG TCATGGGTGA TTCGGTGCCG
GAGACGGAGT CCGACCTGCC GCTCTCGGCG TTCGAGGAGC TGATCGCGGA CGCCGACCCG
GTAGAATCGT GGCCGCCGCT CTCGGAGGAC GACCCGGCCG GGATGTGTTA CACCTCCGGC
ACCACGGGGA AGCCGAAGGG CGTCGAGTAC ACGCACAAGA TGATCTACGC CCACGCGATG
ATGGTGATGA CGCCCGCCGC GCTCGACATC GCCGAGGACG ACGTGGTGAT GCCCGTCGTT
CCCATGTTTC ACGTAAACTC GTGGGAGTTC CCCTACGCCG TGACGATGGC CGGCGCGAAA
CAGGTGTATC CGGGACCGTC CCCGGACCCC GCGGACCTCG TCGAGCTGAT CGAGTCCGAG
GGCGTGACGC TCACGGCGGG CGTGCCCACC GTCTGGATCG ACGTGCTCGA CCACCTCGAC
GAGCACGGCG GCGATCTCTC CTCGCTGGAG CGGATCGTCG TCGGCGGGAG CGCGGCGCCC
CGAGAGGTGA TGCGCCGGTA CGAGGACGAA CACGACGTGA CGATCGAACA CGCGTGGGGG
ATGACCGAGA CGATGAGTAT CGGCTCGGTC TCCCGACCGA CCTCGGCGAT GGCCGGTGCC
GATCGCGAGG CGAAACTCGA CAAACGGGCG AAACAGGGGC TGCTCTCGCC CGGCCTTGAA
ATGCGGGTCG TCGACGACGA CGACAAGCCG GTCGCCTGGG ACGGCGAGGC GTTCGGCGAA
CTGCTCGTCC GGGGACCCTC GGTCGTCGAA GAGTACTACG ACCGCCCGGA GGCCGACGCG
ACGGACTTCG TCGCGGCCGA CGACGGCGGG GCGCGGTGGC TCCGCACCGG CGACATCGCC
ACCGTCGACG AAGACGGGTA CATGGAGGTC GTCGACCGGG TCAAGGACGT GATCAAGTCG
GGCGGCGAGT GGATCTCCAG CATCGAGTTG GAGAACGCCT TGATGGCCCA CGAGGACGTC
GCCGAGGCGG TCGTGATCGC GGCGTCCCAC GAGCGCTGGC AGGAGCGCCC GCTGGCGTTC
GTCGTGCCGA AGGCGGGCCG CGAGCTCGAC GTCGAGGGGA TCCGAACGTT CCTCGCCGAC
GAGTTCCCGC GGTGGTGGCT CCCGGACGAC GTGCGGTTCC GCGAGGAGAT TCCGAAGACC
GCGACCGGGA AGTTCGACAA GAAGACGCTC CGGGAGACCG TCGACGATCC CGCTCTGCCG
TACGCGCCGG GAGAGGAGGG TGGAGAATGA
 
Protein sequence
MPEELTLRPF FWRATRLFPE TEVVSRTHDG VDRYTYADFG DRVRRLAAAL AELGVEPGDR 
VGTLGWNTHR HFEAYYAVPL SGAQLHTVNL LLQDDHVEYI INDAADDVLI VDRDAVATLD
RLWDRIDGVR EVVVMGDSVP ETESDLPLSA FEELIADADP VESWPPLSED DPAGMCYTSG
TTGKPKGVEY THKMIYAHAM MVMTPAALDI AEDDVVMPVV PMFHVNSWEF PYAVTMAGAK
QVYPGPSPDP ADLVELIESE GVTLTAGVPT VWIDVLDHLD EHGGDLSSLE RIVVGGSAAP
REVMRRYEDE HDVTIEHAWG MTETMSIGSV SRPTSAMAGA DREAKLDKRA KQGLLSPGLE
MRVVDDDDKP VAWDGEAFGE LLVRGPSVVE EYYDRPEADA TDFVAADDGG ARWLRTGDIA
TVDEDGYMEV VDRVKDVIKS GGEWISSIEL ENALMAHEDV AEAVVIAASH ERWQERPLAF
VVPKAGRELD VEGIRTFLAD EFPRWWLPDD VRFREEIPKT ATGKFDKKTL RETVDDPALP
YAPGEEGGE