Gene Hlac_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1304 
Symbol 
ID7399399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1313996 
End bp1315954 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content70% 
IMG OID643708368 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002565966 
Protein GI222479729 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACG CCGGAGTTGA CGGCCGGCGG ACGAACGGGC TCGAAGACGG CGACGACGCC 
GTCGAGCCGC CGGCCGGGTT CAGCGATCGG GCCGTCGCCA GCGACGCGGG ACTGTACGAG
GCCTTCGCCG AGGAAGGACC TGAGGCGTGG CGCCGCGCCG CCGACCTGCT CGACTGGGAG
CGTCCCTCCG AGACCGTCCT CGACGACAGC GATCCGCCCT TCTACGAGTG GTTTCCGGAC
GGGCGGCTCA ACGCCGCGGC CAACTGCGTC GACCGCCACC TCGACGAGCG CCGGAACCAG
CTCGCGATCC GCTGGTTCGG CAAGCGCGGA GAGCGCCGGT CGTACACCTA TCTCGACCTC
CACCGCGAAG TGAACGCGCT CGCGGCCGGC CTCCGCGATC TGGGGGTCGA AGAGGACGAC
GTGGTGACGC TGTACCTCCC GATGGTTCCC GAACTGCCGA TCGCGATGCT GGCGTGTGCC
CGGATCGGGG CGCCTCATAA CGTCGTCTTC GCGGGGCTGT CGGCGGAGGC GCTCGCGACC
CGGATCGACG CCGCCGACTC GGAGTACCTC GTCACCTGCG ACGGCTACTA TCGCCGTGAG
GACGCGTTCA ACCAGAAGTC GAAGGCGGAC AACGCCCGAC TCCGGGCCGA CGTCGACCTG
TCCGAGACGG TGGTCGTCGA CCGGCTCGGC GACGCGCTCG ACGTGCCGCT GGGCGACGAC
GAACACGAGT ACGAGGCGAT CGTGGACGCA CACGACGGAG AGACCGTCGA GCCGGTCGCG
CGCGACGCCA CCGACCTCCT CTTCGTGATG TACACCTCCG GGACCACGGG TCGTCCGAAA
GGCGTCGAGC ACGCGACCGG GGGGTACCTC TCGCACGTCG CGTGGACGAC CCGGAACGTG
CTCGACGTGC GCCCCGACGA CACCTACTGG TGTGCGGCCG ACATCGGCTG GATCACGGGG
CACTCCTACG GCGTGTACGG TCCCCTCTCC GTCGGGACCA CGACCCTCCT CTACGAGGGG
TCACCCGACT ACCCCGACCG CGACCGCGTG TGGGACCTGA TCGAGCGCAA CGCCGTCAGC
GTCTTTTACA CCTCGCCGAC CGCGATTCGC TCGTTTATGA AGTGGGGCGC GGAGTACCCC
GAGGCCCACG ACCTGTCGTC GATCCGCCTG TTGGGGACCG TCGGCGAGTC GATCACTCCG
AAGGCGTGGC ACTGGTACCG CAAGCACGTC GGCGGAGGCG AGGCGCCGAT CGTCGACACG
TGGTGGCAGA CCGAGACGGG CGCCATCTCC CTCGCGACCC TGCCGGGGAT CACGCCGATG
AAACCCGGCA AGGTCGGCCC GCCGCTGCCG GGGATCGACG CGCGCGTCGT CGACGAGGAC
GGCGATCCGG TCGAGCCCGG CGAGCCGGGG TACCTCACGA TCGCCGCTCC GTGGCCCGGC
ATGCTCCGCG GGCTCCGCGA GGGCGACGAG CGGTACCGCC GGGAGTACTG GCTGGAGGGC
GACGACGGGT GGCGGTACCG CACCGGCGAC GGCGCCACCG TCGACGAGGA CGGCTACGTG
ACGATCCTCG GCCGCGTCGA CGACGTGATC AACGTCCGGA CGCACCGATT CAACACCGCG
GAGCTGGAGT CAGCCATCGT CGGGGCCGAC GGCGTCACCG AGGCCGCGGT CGTCGGCGAC
GACGACGGGA AGATCGTCGC GTACGTGACG ACTCGGGGCG ACATCGACCC CGATGAATCC
CTCCGCGAGA CGATCGGCGA GCGGCTGGCG CAGGCGGTCG GCGACGTGGC GCGCCCCGAC
CGGATCGTGT TCACCCCTGA CCTCCCGAAG ACGCGGTCCG GGAAGATCAT GCGCCGCCTT
CTGGAGGACA TCGCGCGCGG TGACGAGTTT GGCGACGTGA GCGCCCTCCG CAACCCCGAG
GTCGTCGGCG AGATCGAGTC GGCGGTTCGC GAGGAGTAA
 
Protein sequence
MDYAGVDGRR TNGLEDGDDA VEPPAGFSDR AVASDAGLYE AFAEEGPEAW RRAADLLDWE 
RPSETVLDDS DPPFYEWFPD GRLNAAANCV DRHLDERRNQ LAIRWFGKRG ERRSYTYLDL
HREVNALAAG LRDLGVEEDD VVTLYLPMVP ELPIAMLACA RIGAPHNVVF AGLSAEALAT
RIDAADSEYL VTCDGYYRRE DAFNQKSKAD NARLRADVDL SETVVVDRLG DALDVPLGDD
EHEYEAIVDA HDGETVEPVA RDATDLLFVM YTSGTTGRPK GVEHATGGYL SHVAWTTRNV
LDVRPDDTYW CAADIGWITG HSYGVYGPLS VGTTTLLYEG SPDYPDRDRV WDLIERNAVS
VFYTSPTAIR SFMKWGAEYP EAHDLSSIRL LGTVGESITP KAWHWYRKHV GGGEAPIVDT
WWQTETGAIS LATLPGITPM KPGKVGPPLP GIDARVVDED GDPVEPGEPG YLTIAAPWPG
MLRGLREGDE RYRREYWLEG DDGWRYRTGD GATVDEDGYV TILGRVDDVI NVRTHRFNTA
ELESAIVGAD GVTEAAVVGD DDGKIVAYVT TRGDIDPDES LRETIGERLA QAVGDVARPD
RIVFTPDLPK TRSGKIMRRL LEDIARGDEF GDVSALRNPE VVGEIESAVR EE