Gene Hlac_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2038 
Symbol 
ID7402057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2030938 
End bp2031939 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content76% 
IMG OID643709109 
Productbeta-ribofuranosylaminobenzene 5'-phosphate synthase family 
Protein accessionYP_002566686 
Protein GI222480449 
COG category[R] General function prediction only 
COG ID[COG1907] Predicted archaeal sugar kinases 
TIGRFAM ID[TIGR00144] beta-RFAP synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.172756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.249661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCG TGAGCGCCGG GGCGCGGCTC CACTTCGGCT TCTGTAACCT CAGCCTCTCG 
CACGAGCGGC TGTACGGCGC CCTCGGGCTC GGGCTCGCGG AACCTCGCGT CGTCGTCGAC
GCGGAGCCAG ATTCGGAGAT AACCGTCGCT GTCGAGACGC CCACAACTGA TTCAGCCACC
CGCAACGACA TCCGCGAGTA CGCCACCACC GCCACCGACC TGCTCGGCGT CGATGGCGCC
CAAATCACGG TCCACGAGAC GCTCCCGCGC CACGCCGGAC TCGGGAGCGG CACCCAGCTG
GCCGCGGCGA CGCTCGCAGC GGTCGCGGCC GCCCACGGGA AGGACCCCCG CGTCCGCGAG
CGCGCCCCGG CGCTCGGCCG CGGCGGGCGC TCGGGCGTCG GCGTCGCGAC CTTCGAGGCG
GGCGGGTTCG TGCTCGACGC GGGCCACCCC ACCGCACGGT TCACCACCGA CCGCCCCGCC
GACGGCGAGT GGACCGTGCC GCCGGTGGCC GCCCGCCACG CCGTCCCCGA CGACTGGCGG
TTCCTGCTCG TGCGCCCCGA CGCCGACCCA GGCCGGAGCG GTGACGCCGA GGACGACGCG
ATGCGGACCG CGGTCGAGCG GGCGGAACCC GGGCTCGCAG ACCGGATCGG CGGGATCGTC
ACCCGGCGCG TGCTCCCCGC GATCGCGACC GGGAACGCCG AGCGCTTCGG CGCCGCAGTC
GCGGAGATCG GCCGGCTCAA CGGTGCGTGG TACGCCGACG AACAGGGCGG GGTCTACCGC
CCGCCGGTCG GCGACGTGGT CGCATCGCTG TCGGACGCCG CGGCCGTGTT CGGCGCCGGG
CAGTCGTCGT GGGGGCCGAC CGTATACGGA ATCACGGACG CCGCGAACGC GACTGCGGCC
GCGAGCGCGG GCGAGCGCGC CCTCGACGAG GCGGGCGTTG ACGGGTCGGT CTCGGTCGTC
GAGGCGGCCA ACGGCGGGGC GCGGGTGACG GGGCGGGAGT GA
 
Protein sequence
MARVSAGARL HFGFCNLSLS HERLYGALGL GLAEPRVVVD AEPDSEITVA VETPTTDSAT 
RNDIREYATT ATDLLGVDGA QITVHETLPR HAGLGSGTQL AAATLAAVAA AHGKDPRVRE
RAPALGRGGR SGVGVATFEA GGFVLDAGHP TARFTTDRPA DGEWTVPPVA ARHAVPDDWR
FLLVRPDADP GRSGDAEDDA MRTAVERAEP GLADRIGGIV TRRVLPAIAT GNAERFGAAV
AEIGRLNGAW YADEQGGVYR PPVGDVVASL SDAAAVFGAG QSSWGPTVYG ITDAANATAA
ASAGERALDE AGVDGSVSVV EAANGGARVT GRE