Gene Hlac_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0232 
Symbol 
ID7402161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp251421 
End bp252425 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content74% 
IMG OID643707295 
Productcobalamin biosynthesis protein CobD 
Protein accessionYP_002564907 
Protein GI222478670 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1270] Cobalamin biosynthesis protein CobD/CbiB 
TIGRFAM ID[TIGR00380] cobalamin biosynthesis protein CobD 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.700118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.268973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACTG CTCCCCTCCT TACCCCCGTA CTCGCCTCCC TCGCCACGCT GGCGATCGCG 
GTCGCCCTCG ACCTCGCGCT CGCGGAGCCA CCCGCCCGAG TCCACCCCGT CGCGCTGTTC
GGGTCGGTCG TCGGTCGGTT CGACCGCTCG TGGTCGCGCC CCCGGCTCGT CGGCGTCGCG
GTCGCGGTCG GGCTCCCGAT CGGTGTCGCG GCGTTTGCGG GCGGGATCGT CGCAGCCGCC
TCCTTCGCTC TTCCCGCCTT CTCCGCCCTC CCCGTTCTCG TCGCCGGAAC GATCCTCTTC
ACGACCGTCA GCCTCCGAAT GCTGCTGGCG ACGACCGCCG AGGTCGTCGA ACTGACGGAA
ACGGATCCGG ACGCGGCCCG GGAATCGGTG CGCGCGCTCG CGGGCCGGGA CGCGACCGAC
CTCTCCCCGG CCGACCTCCG GAGCGCGGCC GTCGAGAGCG CGGCCGAGAA CCTCGCCGAC
GGGTTCGTCG CGCCCCTCGG CGGGTTCGCG CTCGGAGCGA CGGTCGGACT CGCGGTCGGC
GGTTCCGAAG TCGCGCTCCC GCTTGCCGCG GGGGTTGCCG CCGCGGTCTG GGTGAAGGCC
GTCAACACGC TCGACTCGAT GCTCGGCTAC CGCTCGAAGC CGGTCGGGTG GGCGAGCGCT
CGGCTCGACG ACGCCGTGAT GTTCCTCCCG GCCCGCGTGA CCGCCGGCTG TCTTTCGGTC
GCGGCCGGAT CGATCGAAGC CCTCCGCTGG GCCGCATCGT GGGCCGGGAA GCCCGGATCG
CCGAACTCCG GGTGGCCGAT GGCGACCGCC GCGGCCGCGC TCGACGTGCG ACTGGAGAAA
CCCGGTCACT ACGTCCTCAA CCCGGACGCG AGCCCGCCCA GCGTCGTCGA CGCGGAGCGG
GCGGTACGGC TCGTCGGCGT TTCCGGCGGG GTCGCGGTCG CTCTCGCGGC GGGATGGCTA
CTCGCGACAA GATGGCTCCC GGCTCCTGCG GGGGTGATCG GATGA
 
Protein sequence
MATAPLLTPV LASLATLAIA VALDLALAEP PARVHPVALF GSVVGRFDRS WSRPRLVGVA 
VAVGLPIGVA AFAGGIVAAA SFALPAFSAL PVLVAGTILF TTVSLRMLLA TTAEVVELTE
TDPDAARESV RALAGRDATD LSPADLRSAA VESAAENLAD GFVAPLGGFA LGATVGLAVG
GSEVALPLAA GVAAAVWVKA VNTLDSMLGY RSKPVGWASA RLDDAVMFLP ARVTAGCLSV
AAGSIEALRW AASWAGKPGS PNSGWPMATA AAALDVRLEK PGHYVLNPDA SPPSVVDAER
AVRLVGVSGG VAVALAAGWL LATRWLPAPA GVIG