Gene Hlac_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2122 
Symbol 
ID7400642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2112218 
End bp2113474 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content72% 
IMG OID643709192 
Productmolybdenum cofactor synthesis domain protein 
Protein accessionYP_002566769 
Protein GI222480532 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACG ACAGACGCGA GGCCGGATTC AAGCGACGGA CCCGCGTCGC GGACGCGCTG 
GCGACGCTAC TCGACGCCGC CGAGCCCCAC GGCCGGACCG AGTCGGCGCC GCTCGCCGAC
GCGGACGGAC GGGTCGTCGC CGAGCCGATC GACGCGCCGG CGCCGGTCCC GAGCTATGAC
CGGGCAGCGA TGGACGGGTA CGCGGTCCGC GCCGAGGATA CGTTCGGTGC GTCCGACCGG
TCACCGGCGG TGCTGCGGGC GGTCGGCACC GAAGCCGACT CGATCGCGCC GGGCGAGGCC
GCGCGGGTGC ACACGGGCAG CGCGGTCCCG GAAGGGGCCG ACGCAGTCGT GATGATCGAG
CAGGTGGAAA CGGTCGCGGA CGAGGTCGAG GCGTTCGACG CGGTCGCCGA GGGAGAGAAC
GTCGGTGAGG CGGGCGAAGA CGTGGCGGAC GGACAGCGCC TCTTCGAGGC CGGCCACGTC
CTCCGGCCCT CCGACCTCGG CCTCCTGAAG TCGGTCGGAC TCGACGCGGC CCCCGTCTAC
GAGCGCTCGA CCATCGCCGT GATCCCGACC GGCGAGGAGC TGGTGCAGTC CGATCCCGGC
CCGGGCGAGG TGATCGAGAC GAACGGGCTG ACGGTCTCGC GGCTGGTCGA GCGGTGGGGC
GGAGAGGCCC GCTACCGCGA CGTGGTCACC GACGACGAGG ACGCGCTCTC GGCCGCGATC
GAGGCCGACC TCGACGCCGA CGTTGTCGTC ACCACCGGGG GGTCCTCGGT TGGCGAGCGC
GACTTGCTCC CGGAGGTCAT CGATTCTATC GGGGAGGTAC TTGTCCACGG CGTCGCCCTG
AAGCCGGGGC ATCCGGTCTG TCTCGGCGTC GTCGACGACA CCCCGATCGT CTCGCTTCCC
GGCTACCCGG TCGCCTGTAT CGTCAACGCC GCGCAGTTCC TCCGGCCGCT CCAGAAGCAC
GTCGGCGGAA CGACCGCGAA CCCGTTCCCG ACGCGGCGCG CGACGCTTAC CCGAAAGGTA
CCGAGCGAGC CCGGCACGCG AACGTTCGCG CGGGTGTCGG TGGAGGGAAC CGGCGAAGGC
GGCGACGGGG ACGGAGACGA CTCGGCCCTC CCCGCCGCCA CGCCGACCCG CGCCAGCGGT
TCTGGCATTC TGTCGAGCGT CGCGCTCGCT GACGGCTGGG TCGTCGTCCC CGAACCGCGA
GAGGGGCTCG ACGCGGGCGA GATCGTCGAC GTGGAGCTGT GGGAGGCGAG CCAATGA
 
Protein sequence
MSHDRREAGF KRRTRVADAL ATLLDAAEPH GRTESAPLAD ADGRVVAEPI DAPAPVPSYD 
RAAMDGYAVR AEDTFGASDR SPAVLRAVGT EADSIAPGEA ARVHTGSAVP EGADAVVMIE
QVETVADEVE AFDAVAEGEN VGEAGEDVAD GQRLFEAGHV LRPSDLGLLK SVGLDAAPVY
ERSTIAVIPT GEELVQSDPG PGEVIETNGL TVSRLVERWG GEARYRDVVT DDEDALSAAI
EADLDADVVV TTGGSSVGER DLLPEVIDSI GEVLVHGVAL KPGHPVCLGV VDDTPIVSLP
GYPVACIVNA AQFLRPLQKH VGGTTANPFP TRRATLTRKV PSEPGTRTFA RVSVEGTGEG
GDGDGDDSAL PAATPTRASG SGILSSVALA DGWVVVPEPR EGLDAGEIVD VELWEASQ