Gene Hlac_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1387 
Symbol 
ID7400706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1396004 
End bp1397221 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID643708448 
Productdihydropteroate synthase 
Protein accessionYP_002566045 
Protein GI222479808 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.398618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACG TGGACGCCGC GGGGCTCGAG ATCGGCGACG ACCACCCGCC TCGGATCATG 
GGCGTACTCA ACGTCTCCGC GGAGTCGCCG TACGACCCGA GCGTGTACGA CGACCCGGGC
GAGGCCGCCG AGTACGTCGA CGAGGAGCTG ATCGGCGAGG GCGCCGACAT CGTCGACGTC
GGGCTCGAAT CGGCCAACAA GGACTTAGAC GTGCTCTCGG CCGAACAGGA GTTGGATCGG
CTCGACACCG CGATCGAGAC GCTGGAGTCG ACCTCGGGCG ACGCCGTCTG GTCGATCGAG
ACCCGCTACC ACGAGGTCGC CGACGAGGCG CTTGCACGCG GGTTCGACAT GGTCAACGAC
ATCTGCGGCT TCGCCGATCC CGAGATGCCC CGCGTCTGCC GCGAACACGA CGCGGCCGTC
TCGAAGATGG CCTCGCCGCC AGATCTGGAG CGACCGGGTG CCATCGAGGA CGTGGACGAG
ATCTACGAAG CGCTGTCGAT GAACGGCCTC ACCGACAAGA CGATCCTCGA CCCCGCGTTC
GGTGGCTGGT CGAAGGCAAA AACCCACGCC GACGACCGCG AGACGTTCCA CCGGCTACGG
GAGTTCCGCG GCTACGGTCG CCCGCTGCTC GTCTCGATCA ACCGCAAGAG CTTCCTCAAG
ACGATCGCGG GACGGAGTAC CGAGGAGGCC CTTCCGGTGT CGCTCGCCGC CACCTCGATG
GCAGTCGAGC GCGGCGCACA CGTGATCCGC ACCCACGATG TGGCCGAGAC GCGGGACGCG
GCACTCGTCG GCGCCGAGTT CGCCCGCGAT CGGGTCCGTT CGGACGACGG GCCCAGCGAC
ATCGCCGTCG AGGAACTCGA CGTGACGACC GTTCGGGAGG CCGAGCGCCA CCTCGACCGG
CTGGACGCCG ACCAGTCCGT CGCCGGCGAC GCGGCCGTTC GCACCTACGA GCTACGCGGG
CTCACCGACG AGGCCGTCGG CGCGCTCCGA GCGGCGACCG CCGAGCCCGG CGTCGGCGCG
GCGTTCGCCC TCGCCGGTTC CGACGCCGCC GAGACCGCGG TTCCCCCCTC GGCCACCGAC
GGCGGGGCCC CCACAAATGG CGAATCCGGA CTGCTCGTCG GAACAGTAGC CGCGCTGTCT
GCGGTTCGAT CGGCCGTTTC GGGCGTTTCA GACGCGCTCG ACGCCGCGCT GGAATCGATC
GACGACGGCT CCAAGTAA
 
Protein sequence
MRNVDAAGLE IGDDHPPRIM GVLNVSAESP YDPSVYDDPG EAAEYVDEEL IGEGADIVDV 
GLESANKDLD VLSAEQELDR LDTAIETLES TSGDAVWSIE TRYHEVADEA LARGFDMVND
ICGFADPEMP RVCREHDAAV SKMASPPDLE RPGAIEDVDE IYEALSMNGL TDKTILDPAF
GGWSKAKTHA DDRETFHRLR EFRGYGRPLL VSINRKSFLK TIAGRSTEEA LPVSLAATSM
AVERGAHVIR THDVAETRDA ALVGAEFARD RVRSDDGPSD IAVEELDVTT VREAERHLDR
LDADQSVAGD AAVRTYELRG LTDEAVGALR AATAEPGVGA AFALAGSDAA ETAVPPSATD
GGAPTNGESG LLVGTVAALS AVRSAVSGVS DALDAALESI DDGSK