Gene Hlac_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1897 
Symbol 
ID7400091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1898149 
End bp1899648 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content66% 
IMG OID643708968 
Productphytoene desaturase 
Protein accessionYP_002566545 
Protein GI222480308 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.29228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTGG CCCCGGACAT CGACGCCGTG GCCGGCGAGT CCGTCGTCGT GATCGGCGGC 
GGATTCGGCG GCCTCTCGAC CGCCTGTTAC CTCGCTGACG CCGGCGCGGA TGTGACCCTC
TTGGAGAAGA ACGAACAGCT CGGCGGGCGC GCGAGCCGGC TGGAGGCCGA CGGCTTCCGG
TTCGACATGG GGCCGTCGTG GTACCTCATG CCCGACGTGT TCGAACGGTT CTTCGGCCAC
TTCGGACGCG AACCGTCGGA GTTCTACGAA CTGGAGCGAC TCGATCCCCA CTACCGGGTG
TTCTGGAAGG ACGGCGACAA GGTCGACGTG CTCCCCGACC GCGAGGCGAA CAAGGCCCTC
TTCGAGGAGT ACGAGCCCGG TGCGGGCGAG GCGTTCGAAG CGTATCTGGA GGAGTCCGAA
CGCACCTACG AGATCGGGAT GGAACACTTC GTGTACGAGG ACCGTCCGCG GCTCCGCGAC
TACGTCGACA AGGACGTGCT CCGGTACTCG TGGGGGCTCT CGCTTCTCGG CAAGATGCAG
GGACACGTCG AGGACTACTT CGACCACCCG AAGCTCCAGC AGCTGATGCA GTACACCCTC
GTCTTCCTCG GTGGCTCCCC GACGAACACC CCCGCGCTGT ACAACCTGAT GAGCCACGTC
GACTACAACA TGGGCGTCTA CTACCCCGAA GGCGGGATCG GCGCCGTCGT CGACGGCATC
GTCGAACTCG GCGAGGACCT CGGCGTCGAG TTCGTCACCG ACGCCGAGGT GACGGCCATC
GAGGGACGCC GCGGGGGCTT CGCGCTCGAC ACCGCCGACG GCGAGCGCTA CCTCACGGAT
CTGGTCGTCT CCGACGCCGA CTACGCCCAC ACCGAACAGG AGCTGCTCCC GAAACACAAG
CGCCAGTACA GCGACGAGTA CTGGGAGTCG CGGACGTACG CGCCCTCCGC GTTCCTGCTG
TACCTCGGCG TCGAGGGCGA CGTGCCGAAC CTCGAACACC ACACGCTGGT GTTGCCGACG
GGGTGGAACC ACCACTTCGA GCAGATATTC GACGACCCAG CGTGGCCCGA TGACCCCGCT
TACTACCTCT GTGCGCCCTC CGAGACCGAC GACACGGTCG CGCCAGAGGG ACACAGCAAC
CTGTTCGCCT TGGTCCCCAT CGCGCCCGGG CTGGAGGACA CTCCCGAACT GCGCGAAGAG
TACCGCGACC TCGTGTTGGA CGACATCGCC GAGAACACCG ACACGGAGCT GCGCGACCGG
ATCGTCTTCG AGGAGACGTT CTGCGTCGAC GACTTCGCGG ACCGGTACAA CAGCTACCAG
GGAAGCGCGC TCGGGCTGGC ACACACCCTG CGACAGACTT CGCTGCTTCG CCCGCCCCAC
CGGTCCGACG CGCTCGACGG GCTCTACTTC ACGGGGTCGA CGACGACCCC CGGTATCGGT
GTCCCGATGT GTCTCATCAG CGGACAGCTG ACCGCCGAGG AGCTGTCGAA GACGGCGTGA
 
Protein sequence
MDVAPDIDAV AGESVVVIGG GFGGLSTACY LADAGADVTL LEKNEQLGGR ASRLEADGFR 
FDMGPSWYLM PDVFERFFGH FGREPSEFYE LERLDPHYRV FWKDGDKVDV LPDREANKAL
FEEYEPGAGE AFEAYLEESE RTYEIGMEHF VYEDRPRLRD YVDKDVLRYS WGLSLLGKMQ
GHVEDYFDHP KLQQLMQYTL VFLGGSPTNT PALYNLMSHV DYNMGVYYPE GGIGAVVDGI
VELGEDLGVE FVTDAEVTAI EGRRGGFALD TADGERYLTD LVVSDADYAH TEQELLPKHK
RQYSDEYWES RTYAPSAFLL YLGVEGDVPN LEHHTLVLPT GWNHHFEQIF DDPAWPDDPA
YYLCAPSETD DTVAPEGHSN LFALVPIAPG LEDTPELREE YRDLVLDDIA ENTDTELRDR
IVFEETFCVD DFADRYNSYQ GSALGLAHTL RQTSLLRPPH RSDALDGLYF TGSTTTPGIG
VPMCLISGQL TAEELSKTA