Gene Hlac_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0231 
Symbol 
ID7402160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp250105 
End bp251415 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content72% 
IMG OID643707294 
Productcobyrinic acid a,c-diamide synthase 
Protein accessionYP_002564906 
Protein GI222478669 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1797] Cobyrinic acid a,c-diamide synthase 
TIGRFAM ID[TIGR00379] cobyrinic acid a,c-diamide synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.383459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.297913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGCC TCGTCCTCGG TGGCACCGCT TCCGGGGTCG GCAAGACTGT CGCGACGCTC 
GCGACGATCC GGGCGCTGGA AGACGCCGGC CACGCCGTCC AGCCGGCGAA GGCAGGGCCG
GACTTCATCG ACCCGAGCCA CCACGAGCGC GTGACGGGGC GTCCCTCGCG CACGCTCGAC
CTGTGGTTAC AGGGTGAGGA CGGACTTCGT CGGAACTACG CCCGCGGCGA GGGCGACGTC
TGCGTCGTCG AGGGCGCCAT GGGGCTGTAC GACGGCGACG GGTCGAGCAC GGCCGCGGTC
GCCGAGACGC TCGGCCTTCC GGTCGTGCTC GTGGTCGACG CGAGCGCCGG CATGGAGAGC
GTCGCGGCGA CCGCACTCGG CTTCCGGGCG TACGCCGACC GGATCGGCCG CGGCATCGAC
GTGGTCGGCG TGATCGCCCA GCGCGCGCAC GGCGGGCGCC ACGCCGACGG AATCCGCGAG
GCGCTCCCGG ACGACCTCAC GTACTTCGGC CGAATTCCGC CGAACGACGA CCTCGCGGTA
CCCGACCGCC ACCTCGGCCT ACACATGGGC GACGAGTCGC CCGTGCCCGA CGACGCGCTC
GACGCGGCCG CGGAGGGACT CCGGACTGAG CGGCTCGTCG ATATCTCGCG GGAGCCGGCG
GGTGCGTTGG AGCCAGCGAC AGCGGTCGAG TCGACCGACG GCGACCGCCC CCGCGTTGCG
GTCGCCCGCG ACGACGCCTT CCGGTTCATG TATCCAGCGA CGATCGAACG CCTGCGCGAG
CGAGCGACGG TGGAGCCGTT CGCGCCGATC GCGGGCGATT CCCTCCCGCC CTGTGACGGC
GTCTACCTCC CCGGCGGTTA CCCGGAGCTG CACGCCGCAG AACTGGCGAT GAGCCCGGCG
CTTGACGAGG TCGCGAGCGC GGCCGCCGAG GGGACTCCCG TGCTCGGCGA GTGCGGCGGG
CTGATGGCGC TCGCCGAGTC GCTGACGACG GTCGACGGCG AGACGCACGC GATGGCCGGC
GTCCTCCCGG CCGACGTGCG AATGTGCGAC CGGTATCAGG CGCTCGATCA CGTCGAACTT
CGGGCGACGC GGGACGCGCC GACGGCGTCG GCGGGGTCGA CCCTGCGGGG TCACGAGTTC
CACTACTCGA CAGCCGAGAT CGGGACCGAC GCCCGGTTCG CCTTCGACGT CGAGCGCGGG
ACAGGGATCG ACGGCGACAA CGATGGCCTG ATCGAACACC AAACGCTCGG AACGTACTGT
CACGTCCACC CCGAAAGCGG GGCGTTCGAC GCGTTTCTCG ACGGACTGTG A
 
Protein sequence
MKGLVLGGTA SGVGKTVATL ATIRALEDAG HAVQPAKAGP DFIDPSHHER VTGRPSRTLD 
LWLQGEDGLR RNYARGEGDV CVVEGAMGLY DGDGSSTAAV AETLGLPVVL VVDASAGMES
VAATALGFRA YADRIGRGID VVGVIAQRAH GGRHADGIRE ALPDDLTYFG RIPPNDDLAV
PDRHLGLHMG DESPVPDDAL DAAAEGLRTE RLVDISREPA GALEPATAVE STDGDRPRVA
VARDDAFRFM YPATIERLRE RATVEPFAPI AGDSLPPCDG VYLPGGYPEL HAAELAMSPA
LDEVASAAAE GTPVLGECGG LMALAESLTT VDGETHAMAG VLPADVRMCD RYQALDHVEL
RATRDAPTAS AGSTLRGHEF HYSTAEIGTD ARFAFDVERG TGIDGDNDGL IEHQTLGTYC
HVHPESGAFD AFLDGL