Gene Hlac_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2123 
Symbol 
ID7400643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2113471 
End bp2115468 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content73% 
IMG OID643709193 
Productputative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein 
Protein accessionYP_002566770 
Protein GI222480533 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.917147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC GCAAGGAGTT CCGCGACCTC GCGACGCCCG AGGCCGCCCG CGAGGCGATC 
GACGACCTCG ATCTCTCGCC GGCGCCGGAG ACAGTCCCCC TAGAGGACGC CCGCGGCCGC
GTCCTCGCGG AGCGGATCGA CGCCGCCATC GACGTGCCGG GGTTCGACCG CGCCTCGATG
GACGGGTACG CGGTCCGCGC CCGCGACACC TTCGGCGCCG ACGAGGCCGA TCCGGCCGAC
CTCGACCTCG TCGGGGCGGT CCACGCTGGC GCGGCGCCCG AGGTCACCGT CGAGCCCGGC
ACCTGCGCCG AGATATCCAC CGGGGCCGTG ATGCCGGACG GCGCTGACGC CGTGGTGATG
GTCGAGCGGA CGGACGAGGT CGGCGGGGAT CCGGACGCCG AGGGAGGCGG CCCCGACCGA
ATCGCGGTCC GGACCGCGGT CGCGCCGGGC GACCACGTGA TGAGCGCGGG CGCCGACATC
GCCGCCGGTG CCCGCGCGCT CGGTCCCGGA ACTCGCCTAA CGCCCCGCGA GATCGGCCTG
CTCTCGGCGC TCGGCGTCGA CGAGGTGCCG GTCGCGGGGA AACCGCGGGT CGGGATCGTC
TCGACCGGCG ACGAGCTGGT TCGCCCCGGA GAGGCGCTCG ATCCCTCGCG CGGCGAGATC
TACGACGTGA ACTCGACGAC GATCGCGGCC GGCGTCGAGG AGGCGGGGGG CGAGCCCGTT
CTCTACCCGC ACGCCGGCGA CGACTACACG GAGATGGAGC GGTTGCTCCG GCGGGCGGCC
GACGAGTGCG ACCTCGTACT CTCGTCCGGA TCGACCTCGG CGAGCGCGGT CGACGTGATC
TACCGCGTGA TCGAGGAGCG TGGCGACCTC CTGCTTCACG GCGTCGCGGT CAAGCCCGGC
AAGCCGATGC TGATCGGGCG ACTCGACCGC GGCGGGAGCG AGGCAGTGGG CGAGAGCGAT
GCCGACGGCG ACACCGACCC TCGTACCGGC GAGTCCGCCT ACGTCGGCCT CCCCGGCTAC
CCAGTCTCTG CGCTCACGAT CTTCCGGACG TTCGTCGCGC CCGCGATCCG CGAGGCGGCT
GGTCAGCCCG AACCCGCGAC GGCAACTATC GAAGGCCGGA TGGCCGTCGG CGAGCGCTAC
GGCGAGGGAC GCATGCGGCT GATGCCGGTC GGACTGCTCG ATCTGAACGA CGGCGATCTG
CCCCTCGTCT ACCCGGTCGA CAAGGGGTCT GGCGCGACGA CGAGCCTCGT CGAGGCCGAC
GGCGTCGTCG CGGTCGACCC GGACACCGAG TACCTCGACC GGGACGAGCG CGTCACGGTG
CAGCTGTTCT CGCCCGACGT GCGTCCGCCG ACCCTGCTCG GCGTCGGCGA GGACGACCCG
GCGCTCAACC GCCTGCTCGA CCGGCTCGAC GCTCCCCGGT ACCTCCCGGT CGGCTCCCGT
GAGGGGCTCC GCCGACTCCG CGACGGCGTG CCCGACGTGG CGGTCGTCGC GGGCCCGACA
GACCGCGACG TCGACGCCGT CGATCTCGGC GGCTGGGCTC GCGAGTGGGG GTTAGTCGTC
CCCGAGGGTA ACCCGGCCGG CGTGACGGGG CTTGCAGACC TCGTCGACGG CGAGATGCGC
TTCGTCAACC GGCCGACCGA CTCGGGGCTC CGCCGGAGCC TCGACGACGC GCTCGCCGAC
CTCGCGACCG ATCGCGACGC GTCGCGCGGT GACCTCGCCG ACCGGATCGA CGGCTACGAG
CTGACAGTTC GCGCGTTCGA GAGCCCGGTC AGGAAGGTGC TCGCGGGCGA CGCCGACGCC
GGACTCGGGC TCCGAGAGAC GGCCGACCGG CTCGACTGTG GGTTCGTCTC GCTCGGCGAG
CAGTCCGTCA CCGTCCGCGC CGCACCCGAC CGGGTCGAGC GGGACGCAGT CGAGGAACTC
GCGGACGCGC TGAATGATCC GACCGACCTC CTCGCCGATC TGGCGGGCTA CTCGCGGAAC
AGTCGGAACG ACCCCTGA
 
Protein sequence
MSDRKEFRDL ATPEAAREAI DDLDLSPAPE TVPLEDARGR VLAERIDAAI DVPGFDRASM 
DGYAVRARDT FGADEADPAD LDLVGAVHAG AAPEVTVEPG TCAEISTGAV MPDGADAVVM
VERTDEVGGD PDAEGGGPDR IAVRTAVAPG DHVMSAGADI AAGARALGPG TRLTPREIGL
LSALGVDEVP VAGKPRVGIV STGDELVRPG EALDPSRGEI YDVNSTTIAA GVEEAGGEPV
LYPHAGDDYT EMERLLRRAA DECDLVLSSG STSASAVDVI YRVIEERGDL LLHGVAVKPG
KPMLIGRLDR GGSEAVGESD ADGDTDPRTG ESAYVGLPGY PVSALTIFRT FVAPAIREAA
GQPEPATATI EGRMAVGERY GEGRMRLMPV GLLDLNDGDL PLVYPVDKGS GATTSLVEAD
GVVAVDPDTE YLDRDERVTV QLFSPDVRPP TLLGVGEDDP ALNRLLDRLD APRYLPVGSR
EGLRRLRDGV PDVAVVAGPT DRDVDAVDLG GWAREWGLVV PEGNPAGVTG LADLVDGEMR
FVNRPTDSGL RRSLDDALAD LATDRDASRG DLADRIDGYE LTVRAFESPV RKVLAGDADA
GLGLRETADR LDCGFVSLGE QSVTVRAAPD RVERDAVEEL ADALNDPTDL LADLAGYSRN
SRNDP