Gene Hlac_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0046 
Symbol 
ID7401399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp47675 
End bp48769 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content70% 
IMG OID643707105 
Producthypothetical protein 
Protein accessionYP_002564722 
Protein GI222478485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.56451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0146014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTCA GGTGTCTGCT CGGGCACGAC TTCGGCGAGC CCGAACTACG GCGCGAGCGC 
GAGGAGGACG GGAACGAGGT TGTCACCACC GTCACCGAGG TAAAGACCTG CGCTCGCTGC
GGCGAGACGC AGGTGGTCAG CGAGAACACT GAGGTCACGA CGATGAAACA GCTGACCGAT
GAGGCCACCG TCGTGGGCGA CGAGCCGACG GGGCCCGACG CCGATTCGGA CCGCGAGACT
CCGGTAACCG GCGTCGAGGG GACCGGTCCC GACGGCGATA TCGACGGCGA CGACGCCGTG
ATCATCGGCA ACAGCCCCGA GGACGGCGAC GACACGGCCG ACATCCCCGC AGAGCCGGGA
GCGGCCGACG CCAGGACACC GGAGACGAAA CCGGGCGATA CGACCGCGTC GGAGTCGGAA
ACAGAAGCGG ACGTGGAGGC GGGAGCGGCC GGCGATGACG GCGGGGCAGA GCTGATCGAC
GAAGGGCCGT CGGGCGCGGG CGACGACGAC AGCAACGGCA GCCTCGAACG CGACGATGGT
GAGTACGCGG CGTACCCGGA GGCCGAGACG ACGGAGCCGA CCGCCGACGA GGAGCGCGCC
GAGACCGACG ACGGCGTGAT TCTCGACGAG GAGGGCGAAG ACGCTGACGA CCGCGAGCGC
GGCGCGTGGC CCGACGTGGA CGAGTCGGAC GAGGGTGGTG AGGAGCCGAC CCCGTGGCCC
GAACACGGCG GCGAAGACGA GGGGTTCAGC GCCGAGCTAG ACGACGGCAA CACGGGCGAC
GTGGAGTTCG GCGGGGGGCT CACGCCCGAG GCCGCCGACC AGCCGACCGA CGGTGAGGAC
GCGGACTACG TCGAGGCACC GGCGCAGACA GCGGTCGAAG CGAACGGTGC GGCCGAGACC
GGCAGCGCAG TCGACGACGG CGTCGGGATC ACCCGCGGCG ACAGCCCGGA CCTCGAAACG
TCGACCTCAG AGGTGACGAC AGAGTACTAC TGTCCCGAGT GCGAGATGAC TCGCGCCGCC
GACGGCAACT CCATGCGCGC GGGCGATATC TGTCCGGAGT GCAAGCGCGG GTACGTCGAC
GAGCGACCAA TCTAA
 
Protein sequence
MGLRCLLGHD FGEPELRRER EEDGNEVVTT VTEVKTCARC GETQVVSENT EVTTMKQLTD 
EATVVGDEPT GPDADSDRET PVTGVEGTGP DGDIDGDDAV IIGNSPEDGD DTADIPAEPG
AADARTPETK PGDTTASESE TEADVEAGAA GDDGGAELID EGPSGAGDDD SNGSLERDDG
EYAAYPEAET TEPTADEERA ETDDGVILDE EGEDADDRER GAWPDVDESD EGGEEPTPWP
EHGGEDEGFS AELDDGNTGD VEFGGGLTPE AADQPTDGED ADYVEAPAQT AVEANGAAET
GSAVDDGVGI TRGDSPDLET STSEVTTEYY CPECEMTRAA DGNSMRAGDI CPECKRGYVD
ERPI