Gene Hlac_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1089 
Symbol 
ID7400161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1092273 
End bp1093598 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content73% 
IMG OID643708155 
ProductMmgE/PrpD family protein 
Protein accessionYP_002565754 
Protein GI222479517 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTG AGCCCGAACG CGATCTGGCG GCGTTCGCCG CGGAGCTTGA GACGGAAGCG 
ATTCCCGACC GGGTCCGCGA CCGGGCGGGA CTCACAATCG CCGACACGCT CGGCGCAATC
GTCGCGGGGT CGACCGACGA CGCGGTCGTC GCGCTCGCGC GGCGCTGGAC CGACGGCGTC
TCCGGCGGGG CGACGGTGCT CGGCGCCGAC GGCGGCGAGA CGGTCCCGCC GCTGGCGGCG
CTCTGCAACG GCGCGGCGGG CACCGTCCTC GAACTCGACG AGGGGCATCG GTTCGCCGCC
GGCCACCCGG CGATCCACGT GTTGCCCGCG CTGTTGGCCG ACGCCGAGAT CGGCTACGGC
GACAGTGACG CGTTCGTGCG CTCGTTCGTC GCGGGCTACG AGGTCGCCGT CCGAACCGCC
CGCGCGGTCG GGACGCTCGA ATCGGGGTAC CACCCGCACG GCGTGTGGGG GGCGGTCGGC
GGCGCGGCCG CAGTGGCACG CTCTCGCGGA CTCGACCCGG AGACGACGCG CTCGGCCATG
GCCATCGCGG CGAACTACGC GCAACACACC CGGTTCGAGG CGGCGACGGA GGGCGCGACC
GTGCGGAACG TCTACGCCGG CATGAGCAAC CTCGCGGCGC TGGTCGCCGT CGATCAGGCG
GAAGCCGGGT TCGGCGGCTT GGAGAACGGC GTCGCGCGGC ACCTCGAATC CGCCGCCGAC
GGGGTCGACG AGGCAGCCCT CTCGGCGGGA CTCGGCGAGC GCTGGGAGCT GGAACACGGC
TACTTCAAGA TCCACGCCGC GTGTCGGTAC ACCCACCCGA CGCTGGATGC CATCGCGGCC
CTCCCGGACG GGTTGGATGC GGCCGCGGTG GAGTCGGTCC GCGTCGAGAC GTATCCGGCG
GCCGCACGGC TGACGGAGTC GCGACCGCAA AACCAACTGC AGGCGAAGTT CTCGATCCCG
TTCGCGGTCG CGACGGCGCT GCTGCGCGGC GAGACCGGAC CGACCGCGTT CGTGGACGAG
GCGATAACTT CAGAAGCGAT CGCGCTCGCC GAACGCGTCA CGGTCGCTGT CGACGACGAG
ATCGCCGCCC GGGCTCCCGA ACAGCGGGGC GCACGGGTGA TCGTCGAGAC GGCGAACGAG
CGCTTCTCGC GAGAGGTCGT CGCCCCGCGA GGCGGCGAGC ACGACCCGTT CGACGAGGGG
CGGCTCGAAT CGAAGTTCCG AGAGCTGGTC GCGCCCGTGA TTGGCGCGGA CCGGGCGGCC
ACGCTCTGGG AGAGCGCCAG GGCGCCGGAG CCGCCGCGCG TGCTCTGTAC GCTCGCCCGG
CGCTGA
 
Protein sequence
MPPEPERDLA AFAAELETEA IPDRVRDRAG LTIADTLGAI VAGSTDDAVV ALARRWTDGV 
SGGATVLGAD GGETVPPLAA LCNGAAGTVL ELDEGHRFAA GHPAIHVLPA LLADAEIGYG
DSDAFVRSFV AGYEVAVRTA RAVGTLESGY HPHGVWGAVG GAAAVARSRG LDPETTRSAM
AIAANYAQHT RFEAATEGAT VRNVYAGMSN LAALVAVDQA EAGFGGLENG VARHLESAAD
GVDEAALSAG LGERWELEHG YFKIHAACRY THPTLDAIAA LPDGLDAAAV ESVRVETYPA
AARLTESRPQ NQLQAKFSIP FAVATALLRG ETGPTAFVDE AITSEAIALA ERVTVAVDDE
IAARAPEQRG ARVIVETANE RFSREVVAPR GGEHDPFDEG RLESKFRELV APVIGADRAA
TLWESARAPE PPRVLCTLAR R