Gene Hlac_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1140 
Symbol 
ID7400949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1146596 
End bp1147507 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content62% 
IMG OID643708205 
Productdiacylglycerol kinase catalytic region 
Protein accessionYP_002565804 
Protein GI222479567 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA CCGTGATTAT CTACAACCCG CAAAGCGGAG GCGGCTCACA TGCCGACGAC 
GTCGAGGACC GCGCGGATCT GAGCGGGTAT GCAGTCGAAC GGTCTGAACA CGCCGGCGAA
GCGGTCACGC TGACACAGGA GGCTATCGAG GCCGGATACT CGACGATCGT GGCCGGCGGC
GGTGACGGGA CGGTCAACGA GGTTGTTCAG GGGATCGACC GGGCCGACGC GTTCGACGAC
GTCACGTTCG GCATCCTGCC TCTCGGGACG GGTAACAACT TCGCGAAGCA GATCGGCATT
ACCGATCTCG AAACCGCGTT CATTGCCCTC GATGACGGTG TCAGGCGTAC TATCGATATC
GGGATGGCAA CCGATCGGCC CTTCGTGAAC TCCTGTGTCG CCGGGCTAAC CGCCGAGTCA
GTGAGCGGGA CGTCCGGAGC GTTGAAGTCT CGTATCGGTG GGTTGGCATA CGTGCTCACG
ACGCTCCGGA CCGTGACCGA TTTCGAGCCG CTACAGCTTA CGATTGATAA CGAGATGAGC
GACGGCGACA CGCCGACGTG GAGCGGTGAA GCGCTCTGTG TGGTGGTCGG GAACGGCCGT
CAGTTCGCGG CGGACGGGAC GACACAGGCC AACATGGAGG ACGGTCTCTT CGAGGTCGCG
ATCGTCACGG ACGTGCCCGC GATTGATCTG ATGAGTGATG CGGTACTTGA GCGCCTGTTC
GGCCAGGACT CGCCACACAT CGACCGGTTC CAAGCCGCAT CGGTGGATAT CAGGGGCCAC
TCGTCGGACC CCATCAGATT CAGCGTGGAC GGGGAGACCA TCGAGCAACG CGACCTCGTG
CTCACTGTTC GACCGAACAG GCTGCGGCTC GTCGTCGGGG AGGGATACGA CCCCTCTCCG
ATGGACACGT GA
 
Protein sequence
MADTVIIYNP QSGGGSHADD VEDRADLSGY AVERSEHAGE AVTLTQEAIE AGYSTIVAGG 
GDGTVNEVVQ GIDRADAFDD VTFGILPLGT GNNFAKQIGI TDLETAFIAL DDGVRRTIDI
GMATDRPFVN SCVAGLTAES VSGTSGALKS RIGGLAYVLT TLRTVTDFEP LQLTIDNEMS
DGDTPTWSGE ALCVVVGNGR QFAADGTTQA NMEDGLFEVA IVTDVPAIDL MSDAVLERLF
GQDSPHIDRF QAASVDIRGH SSDPIRFSVD GETIEQRDLV LTVRPNRLRL VVGEGYDPSP
MDT