Gene Hlac_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0723 
Symbol 
ID7400196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp738785 
End bp739933 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content67% 
IMG OID643707789 
Product2-methylcitrate synthase/citrate synthase II 
Protein accessionYP_002565395 
Protein GI222479158 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.517524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.584309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACG AGTTAAAACG CGGGCTCGAA GGCGTCCTCG TCGCGGAGTC GGATCTGAGC 
TACGTCGACG GCGAGGTCGG CAAGCTCGTG TACCGTGGGT ACGACATCGA GGACCTCGCT
CGCGGTGCGA GCTACGAGGA GGTGCTGTAC CTCCTGTGGC GCGGCTCGCT GCCGACGCGT
GAGGAGCTCG ATGCCTTCAC CGCGGATCTC GCCGCCGAGC GCGCCGTCGA CGACGACGCG
CTCGACGCCG TCCGGACGCT CGCCGACGCC GGCGAACGCC CGATGGCGGC GTTGCGGACC
GCAGTCTCTA TGCTGTCGGC GTACGAGCCG GAGTCGGATG CCGACCCCGA GGATCTCGAC
GCGACGCTCC GGCAGGGCCG CCGGATCACG GCGAAGATCC CGACGCTTCT CGCCGCCTTC
GAGCGCGCGC GGCAGGGCGA GGACCCGGTC GCGCCCGACC CCGACCTCTC ACACGCCGCG
AACTTCCTCT ACATGCTCAC CGGGACCGAG CCCGACGACG TGAGCGCCGA GACGTTCGAC
ATGGCGCTGA CGCTCCACGC CGATCACGGA CTCAACGCCT CGACATTCAC CGCGATCGTG
ATCGGCTCGA CGATGGCCGA CGTGTACTCC GGTGTCACCG GCGGGATCGG CGCACTCTCC
GGCCCCCTCC ACGGCGGCGC GAACCAAGAC GTGATGGAGG TGCTTCAGGA GGTCGACGCC
TCCGATAAGG ACCCCGTACA GTGGGTAAAA GACGCCCGCG AAGAGGGTCG GCGCATCCCC
GGCTTCGGCC ACCGCGTCTA CAAGGTCAAA GACCCTCGTG CGAAGATCCT CGAAGAGAAG
CTACGTGACC TCTCGGAGTC GTCCGGCGAC ACGAAGTGGC TCGACTACAC CACCGCAATC
GAGGAGTACC TCACCGAACA GGGATTGCTT GATAAGGGAA TCGCTCCGAA CGTCGACTTC
TACTCCGGAT CCGTCTACGA CTCGCTGGGG ATCCCGGTCG ACATGTACAC CCCTATCTTC
GCGATGAGCC GCGCCGGCGG CTGGATCGCT CACATGGTCG AGTACCAGGA GGACAACCGC
CTCATCCGCC CGCGGGCGCG GTACACCGGT CCCAAAGCGT CCGAGTTCGT TCCCGTCGAC
GAGCGGTGA
 
Protein sequence
MSDELKRGLE GVLVAESDLS YVDGEVGKLV YRGYDIEDLA RGASYEEVLY LLWRGSLPTR 
EELDAFTADL AAERAVDDDA LDAVRTLADA GERPMAALRT AVSMLSAYEP ESDADPEDLD
ATLRQGRRIT AKIPTLLAAF ERARQGEDPV APDPDLSHAA NFLYMLTGTE PDDVSAETFD
MALTLHADHG LNASTFTAIV IGSTMADVYS GVTGGIGALS GPLHGGANQD VMEVLQEVDA
SDKDPVQWVK DAREEGRRIP GFGHRVYKVK DPRAKILEEK LRDLSESSGD TKWLDYTTAI
EEYLTEQGLL DKGIAPNVDF YSGSVYDSLG IPVDMYTPIF AMSRAGGWIA HMVEYQEDNR
LIRPRARYTG PKASEFVPVD ER