Gene Hlac_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2540 
Symbol 
ID7401593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2514999 
End bp2516879 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content68% 
IMG OID643709612 
ProductCarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_002567182 
Protein GI222480945 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.391908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATA AGGTTCTGGT CGCGAACCGC GGAGAAATCG CGGTTCGGGT GATGCGCGCC 
TGTGCTGAGC TGGGTGTCGA CACCGTCGCC GTCTACAGCG ACGCGGACAA ACACGCGGGT
CACGTCCGGT ACGCGGACGA GGCGTACAAC GTGGGACCGG CCCGCGCTGC CGACTCGTAC
CTCGACGGCG AGGCGGTCGT CGAGGCCGCG AAGGCGGCCG ACGCCGACGC GATCCACCCC
GGCTACGGCT TCCTCGCGGA GAACGCCGAC TTCGCGGCCC GCGTCGAGGC GACCGACGGG
ATCACCTGGG TCGGTCCGTC GAGCGACGCG ATGGAGCGGC TCGGGGAGAA GACCCACGCG
CGCCGCGTGA TGGACGACGC CGACGTGCCC ATCGTCCCCG GGACGACCGA GCCCGTTACC
GACGTCGAGG CGGTGACCGA GTTCGGCGAC GAGCACGGCT ACCCCGTCGC GATCAAAGCC
GAGGGCGGCG GCGGCGGCCG CGGGATGAAA GTCGTCGAGA GTGCCGACGA GGCCGAGAAG
GCCCTCGAAT CCGCGAAACG CGAGGGCGAG GCGTACTTCT CGAACGACTC CGTCTACCTC
GAACGCTATC TACAGAACCC CCGCCACATC GAGGTCCAGA TCGTCGCAGA CGACCCCGAG
GGTAACGGGT CCCTCGACGA GAGCGACGTG GTCCACCTCG GCGAGCGCGA CTGCTCGCTC
CAGCGCCGCT ACCAGAAGGT GATCGAAGAG GGGCCTTCCC CCGCGCTCTC CGACGAACTG
CGCGAGCAGA TCGGCGAGTC CGCTCGCCGC GGCGTCGCCG CCGCCGACTA CACCAACGCC
GGCACCGTCG AGTTCCTCGT CGAGGAGGAC GTGGACCGCG ACCCGTCGGA CCTGCTCGGG
CCGGACACGC CCTTCTACTT CCTCGAAGTC AACACGCGGA TTCAGGTCGA ACACACCGTC
ACGGAGGAGC TGACGGGGAT CGACATCGTG AAAGAGCAGC TCCGGGTCGC CGCCGGGGAG
GGCATCTCCG TCTCGCAGGA CGACGTCGAG CTTGACGGCC ACGCGATCGA GTTCCGGATC
AACGCCGAGA ACGCCGCCGC CGACTTCCAG CCCGCCAACG AGGGACGCTT GGAGACCTAC
GACCCACCGG GCGGAATCGG CGTCCGCGTC GACGACGCGC TCCGACAGGG CGACGAGCTG
GTCACCGACT ACGATTCGAT GATCGCGAAG CTGATCGTGT GGGGTTCGGA CCGCGAGGAG
TGTCTCGCGC GGTCGAAGCG CGCGCTCGCG GAGTACGACC TCGAAGGAGT CGTCACCATC
GTCCCGTTCC ATCGTCTCAT GCTCGACGAC GAGCGGTTCG TCGCGGGCAC CCACACCACG
AAGTACCTCG ACGAGGAGCT CGATGAGGCG CTGGTCGCGG ACGCACAGGA GAAGTGGGGC
ACCGAGTCGT CCGCGAGCGG CGACGACGAT GAAGAGGTGT CCGAACGCGA GTTCACCGTC
GAGGTGAACG GCAAGCGTTT CGAGGTCGAA CTGGAGGAGC GCGGCGCACC CGCGATCCCG
GTGCCGGAGG GCGGGATGGG CGGCGCCGGC GGCAGTGGTG GCGAGCAGCG GCCCCCGCAG
GCGAAGTCCG ACGACGGGAG TGACGACGGT GTCGACATCG CTGAGGGCGG CGAGGCGATC
GAGGCGGAGA TGCAGGGGAC GATCCTCTCG GTCGACGTCG ACGAGGGCGA CGAGGTCGCC
GCCGGCGACG TGGTCTGTGT GCTCGAAGCG ATGAAGATGG AAAACGACGT GGTCGCCGAG
CGCGGCGGCA CCGTCGTGAG CGTTCACGCC GGCGAGGGCG ACAGCGTCGA TATGGGCGAC
GTGCTGATCG TGTTGGAGTA G
 
Protein sequence
MFDKVLVANR GEIAVRVMRA CAELGVDTVA VYSDADKHAG HVRYADEAYN VGPARAADSY 
LDGEAVVEAA KAADADAIHP GYGFLAENAD FAARVEATDG ITWVGPSSDA MERLGEKTHA
RRVMDDADVP IVPGTTEPVT DVEAVTEFGD EHGYPVAIKA EGGGGGRGMK VVESADEAEK
ALESAKREGE AYFSNDSVYL ERYLQNPRHI EVQIVADDPE GNGSLDESDV VHLGERDCSL
QRRYQKVIEE GPSPALSDEL REQIGESARR GVAAADYTNA GTVEFLVEED VDRDPSDLLG
PDTPFYFLEV NTRIQVEHTV TEELTGIDIV KEQLRVAAGE GISVSQDDVE LDGHAIEFRI
NAENAAADFQ PANEGRLETY DPPGGIGVRV DDALRQGDEL VTDYDSMIAK LIVWGSDREE
CLARSKRALA EYDLEGVVTI VPFHRLMLDD ERFVAGTHTT KYLDEELDEA LVADAQEKWG
TESSASGDDD EEVSEREFTV EVNGKRFEVE LEERGAPAIP VPEGGMGGAG GSGGEQRPPQ
AKSDDGSDDG VDIAEGGEAI EAEMQGTILS VDVDEGDEVA AGDVVCVLEA MKMENDVVAE
RGGTVVSVHA GEGDSVDMGD VLIVLE