Gene Hlac_0238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0238 
Symbol 
ID7401164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp256422 
End bp257978 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content71% 
IMG OID643707301 
Productcobyric acid synthase 
Protein accessionYP_002564913 
Protein GI222478676 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1492] Cobyric acid synthase 
TIGRFAM ID[TIGR00313] cobyric acid synthase CobQ 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.267463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.284689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC ACAATCTCGA CACCGAGACA ATCCTCGTCG CCGGCACCGC GAGCCACGCC 
GGTAAGAGCA CGCTCGTGGC CGGGCTCTGC CGACTCCTCG CGCGCCGGGG CGTCTCTGTC
GCGCCTTTTA AGGCTCAGAA CATGAGCAAC AACGCTCGGG TCGCGCTCAC GCCCGACGGC
GAGTGGGGCG AGATTGGCGT CTCACAGCAC GTGCAAGCGC GAGCCGCCCA GATCCCGGCG
ACGACCGACA TGAACCCCGT CCTGCTCAAA CCCCGCGGCG ACGGCGAGAG CCAGTTGGTG
ATCAACGGGG GGGCGGTCGG CCACTTCGCG GCCGGCGAGT ACTACGAGTC GCACTGGGAC
CAAGCACGGG ACGCGGCGGT CGCGGCTCAC CGTCGACTGG CGGCCGATCA CGACGTGATC
GTCGCGGAGG GAGCGGGGAG TATCGCCGAG ATCAACCTCC ACGACCGCGA CCTCGCGAAC
GTGGAGTGCG CCCGGTTCGC CGACGCCGCA ATCGTGATCG CGGTCGACAT CGAGCGCGGC
GGCGCCTTCG CGAGCCTCTA CGGCACCCTC GAACTCCTCC CCGACGACGT TCGCGAGCGC
GTCGCCGGCG CCGTGATCAC CAAATTCCGC GGCGACCCCG CCCTCTTGGA GCCCGGCATC
GAGGAGATCG AAGCGCGGAC CGAGGTCCCG ATCGTCGGCG TCGTCCCGCA CGACGACCCC
GGCCTCCCGG CCGAGGACAG TCTCTCGCTG CCGGACGCGG ACGGTGAGGG CGGACCCGGC
GTTTCCGGCA CCGACGACGG CGTCCCCGAG GAGGAGACGG TCCGGATCGC GGTCCCCCGG
CTCCCGCGCA TCTCGAACTT CACCGATCTG ACGCCGCTCG CTCGTGAGCC GGGGGTCCGG
GTGGCGTACG TCCCGCTCGA CGCGACCCTC GACGCCCCCC TCGCCGACGC CGACGCGGTC
GTCCTCCCCG GCTCGAAGAA CACGGTCGAC GACCTGCTCG CGCTGCGCGA GGCGGGCCTC
GACGAGGCGA TCCGCCGCTT CGAGGGCCCG ATCGTCGGGA TCTGCGGCGG CTACCAGCTG
CTCGGCGAGC GGATCACCGG CGCCGATATC GAGGGCACCG GCTCGGAATC AACCGTCGAG
GGCGTCGGCG TGCTCCCGGT CGAGACGCAC TTCTCGCCCG ACAAGCGCGT CGAGCGCGTA
ACGCGCACCG TCGCCGGTGT CGGCCCGCTG GCCGGGGCGA CCGGAACCGC AGCTGGCTAC
GAGATCCACA TGGGTCGATC GTCACCGACC GGAGAGACCG CCCGCCCGCT GGGCCCGGAG
AGCGCGGCGA CGGGCCGGGC GCTGGGGACC TACCTCCACG GGCTCTTCGA GAACGGGGCG
GTCCGTGACG CCTTCGTATC CACCGTCTTC GAGATGGCGG ACGAGTCGCG GCCCACGGAG
CCGTCGCGGT CGGCTGGAGA TCCAGACGAC ACCGATCGAT CCTCCTACGA CCGCGCCGCC
GACCTCGTCG CCGAGAATGT CGACCTCGCG GCCGCAGGGC TCGGCGAGTT GGAGTGA
 
Protein sequence
MTDHNLDTET ILVAGTASHA GKSTLVAGLC RLLARRGVSV APFKAQNMSN NARVALTPDG 
EWGEIGVSQH VQARAAQIPA TTDMNPVLLK PRGDGESQLV INGGAVGHFA AGEYYESHWD
QARDAAVAAH RRLAADHDVI VAEGAGSIAE INLHDRDLAN VECARFADAA IVIAVDIERG
GAFASLYGTL ELLPDDVRER VAGAVITKFR GDPALLEPGI EEIEARTEVP IVGVVPHDDP
GLPAEDSLSL PDADGEGGPG VSGTDDGVPE EETVRIAVPR LPRISNFTDL TPLAREPGVR
VAYVPLDATL DAPLADADAV VLPGSKNTVD DLLALREAGL DEAIRRFEGP IVGICGGYQL
LGERITGADI EGTGSESTVE GVGVLPVETH FSPDKRVERV TRTVAGVGPL AGATGTAAGY
EIHMGRSSPT GETARPLGPE SAATGRALGT YLHGLFENGA VRDAFVSTVF EMADESRPTE
PSRSAGDPDD TDRSSYDRAA DLVAENVDLA AAGLGELE