Gene Hlac_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2331 
Symbol 
ID7401948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2328965 
End bp2330536 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID643709404 
ProductHTTM domain protein 
Protein accessionYP_002566977 
Protein GI222480740 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.524598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGAAC GTCGAGACGC TGCCGTCGAC GCCTCCGGAT CCGCGGCCGA TGGTGCCGGA 
TCCACGATCG ACGACGCCGA GCCCTCGGTC CGCGGTCGTC TCCCCGCCGC GCTCCGGCGC
CGGATCGGGA TCGACGTGCG CGCGCTCGCG GCGTTCCGGA TCGCGCTCGG CGCCGTCCTC
CTCGTTGACC TCGCGCTGCG CGCCCGGAAC TTGACGGCCT TTTACACCGA CGCCGGCGTA
CTCCCCCGAT CGCTGCTCGC GGAGTCGTCC CCCCTCGCGC GATTCTCGCT GTACGCGGTC
TCCGGTGAGG CGTGGTTCGT CGGGCTGCTG TTCCTCATCG CCGCCGTCGC CGCCGTCGCG
CTCGCGGTCG GCTACCGGAC ACGGATCGCG GCTGCGGTCT CGCTGGTCCT GCTCGCGTCG
CTGCAGGCGC GGAACCCGTT CGTGCTCAAC GCCGGCGACA CGCTCCTCTG GCAGCTGCTC
GGGGCGGGCT TGCTGTGTCC CCTCGGCGCG CGCTGGTCGG TGGATGCTGT CCGGAGGCGC
GCCGCGTTGG GAGGGCGATC CCTGCCCGAA AGCAGCCGAT TTACCGGCCC CCAATCAGCC
CTCCTGTTGA CCGTCGTCGT CGCGGTCTAC GTCTCCAACG CAGTCGTGAA GCTCCGCGGC
GAGGCGTGGC CCGCGGGCGA GGCGGTCGGG ACCGTCTTCC GCCTCACGTA CCTCCACGGC
CCGCTCGGGG GACTGATGCC CGAGAGCCCG GCGCTGCTCG CGGCCGTCAC CTACGGCTGG
CTCGCGCTAC TCGTCGCGTC GCCGTTACTC GTCGCGGCCG CCGGACGAGT CCGGGCTGCG
CTCGCCGGCA TCCTCGTCGC TGCCCATCTC TCGATGGCGT TCACGCTCCA GATCGGCGTC
TTTCCGGTGG TGTCGGCGAC CGCACTGCTA CCCTTCTGTC CGCCGTTCGT CTGGGACCGG
ATCGAGTCGC TGGCCGCTCC GGAGATCGGG CGGTTCCGGT CGATGGCAGA GCGCCTCCTC
CGTTCCCTCC GGTCGACGCG ACCCGGATCG ACCCTCGTCG ATCTGGCCTC CAAAATTGTT
CCCGACAGAG CGACCCGCGA ACGTCTCGTC GCCGTCATCG CCGCGCTCCT GCTCGTCTCG
CTGCTCGCGT GGACCGCCAT GGGGGTCGGA GTCGTCGACG CGCCAGAGCC CGTCGTGGCG
GTGTCAGATC CGGCCGAGAG CGACTGGGAT ATGTTCGCGC CGGAGCCGCC GTCGACCGAC
GCGCTCGTGC TCGCGACGGC GACGACCGCC GACGGCGACC GGACCGATGC GTTGCACGGC
GACCCGGTCG CGACCGACCG CACCCCGTCC GACGCGCGGG GATATCCCAC CGCCCGCTGG
CGGAAGCACT TCTCGCTGCT GTCGGCCGAC GATACCGATC GCATCGACGC GACGCTCGCG
CACCTCTGTG ACCGCGCAGC GGGATTTTCC GGCGCGGAGA CGGAGGCGGT GACGGTCTCC
GCTGTCGAAG TCGACGTCGT CGGGAGCGAG GAGATCTGGG TCCGAGAAGC CGGTACGCGT
GAGTGCCGGT GA
 
Protein sequence
MDERRDAAVD ASGSAADGAG STIDDAEPSV RGRLPAALRR RIGIDVRALA AFRIALGAVL 
LVDLALRARN LTAFYTDAGV LPRSLLAESS PLARFSLYAV SGEAWFVGLL FLIAAVAAVA
LAVGYRTRIA AAVSLVLLAS LQARNPFVLN AGDTLLWQLL GAGLLCPLGA RWSVDAVRRR
AALGGRSLPE SSRFTGPQSA LLLTVVVAVY VSNAVVKLRG EAWPAGEAVG TVFRLTYLHG
PLGGLMPESP ALLAAVTYGW LALLVASPLL VAAAGRVRAA LAGILVAAHL SMAFTLQIGV
FPVVSATALL PFCPPFVWDR IESLAAPEIG RFRSMAERLL RSLRSTRPGS TLVDLASKIV
PDRATRERLV AVIAALLLVS LLAWTAMGVG VVDAPEPVVA VSDPAESDWD MFAPEPPSTD
ALVLATATTA DGDRTDALHG DPVATDRTPS DARGYPTARW RKHFSLLSAD DTDRIDATLA
HLCDRAAGFS GAETEAVTVS AVEVDVVGSE EIWVREAGTR ECR