Gene Hlac_2743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2743 
Symbol 
ID7401354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2731570 
End bp2732751 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID643709818 
Productaspartate kinase 
Protein accessionYP_002567384 
Protein GI222481147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAG TCGCCAAGTT CGGCGGCACG AGCCTCGGGA GCGGCGACCG GATCGAGCGC 
GCGGCGGACT CGGTCGCGAA CGCGGTCGCG GCGGGCCACG AGATCGCAGT CGTCGCGAGC
GCGATGGGGT CGACGACCGA CGACCTCCTC GACGACATCA CCTTCGAGAC GGACGACGCC
GACCGCGCCG AGATCGTCTC GATGGGCGAG CGCACCAGCG TCCGAATGCT GAAGGCGGCG
CTCTCGGTTC GCGATGTCGA CGCCGTCTTC CTCGAACCCG GCCACCCGGA CTGGCCCGTC
ATCACGAACG AGGTCGGCGA GGTCGACGTT GAGGAGACGA AGAAACGCGC GCGCAAGATC
GCCGCCCGCA TGGATGGGGT CGTCCCCATC ATCACCGGGT TCCTCGCGGA GGACCACGAC
GGGAACGTCA CGACGCTCGG GCGCGGCGGG TCCGACACCA CTGCCGTGAT GCTCGGCAAC
TACATGGACG CCGACGAGGT CGTGATCGTC ACCGACGTCG AGGGCGTAAT GACCGGCGAT
CCGCGGGTGG TCGAGGGCGC GCGCAACGTC GGGCAGATCA CCGTCGACGA GCTACGGAAC
CTCTCGTTCC GCGGCGCGGA GGTCGTCGCG CCGTCCGCGC TCTCGTACAA GGACGAGGAC
CTCGCAGTCA GAGTCGTCCA CTACCAGCAC GGCGACCTGC TCAGAGGCGG TACCCGGATT
GAAGGCGAGT TCGAGAGTCT GATCGACATG CGCGAAGAAC CGCTCGCGTG TCTCACCATC
GCCGGTCGCG CGATCCGAAA CCGCTCGGGT ATCCTCTCGC AGCTCGCGAA CGCGCTCCGC
GAGGAGGAGA TCAACATCGA TGCGGTCGCC TCCGGAATGG ACTCGGTCAC CTTCTACGTC
GACGTCGACG TGGCCGAGAC GGCCGAGGCG CTACTCCACG AGGCCGTCGT CGAGGACGAG
GCGCTCTCCT CGGTGACCGT CGCCGACCCG ATCGCGGTCA TTCGGGTGAC CGGCGGCGAA
CTCCCGAACC AGTCCGGCGT CATCCAGGAG ATCATCGCGC CCCTCGCCGA CGACGGGATC
AACATCATCG ACCTGATCAC GAGCGCGACC TCCGTCGCGG TATTCGTCGA CTGGGACGAT
CGCGAAGACG CGCTCGAAAT CGTTCAGAGC CGGTTCGACT GA
 
Protein sequence
MRVVAKFGGT SLGSGDRIER AADSVANAVA AGHEIAVVAS AMGSTTDDLL DDITFETDDA 
DRAEIVSMGE RTSVRMLKAA LSVRDVDAVF LEPGHPDWPV ITNEVGEVDV EETKKRARKI
AARMDGVVPI ITGFLAEDHD GNVTTLGRGG SDTTAVMLGN YMDADEVVIV TDVEGVMTGD
PRVVEGARNV GQITVDELRN LSFRGAEVVA PSALSYKDED LAVRVVHYQH GDLLRGGTRI
EGEFESLIDM REEPLACLTI AGRAIRNRSG ILSQLANALR EEEINIDAVA SGMDSVTFYV
DVDVAETAEA LLHEAVVEDE ALSSVTVADP IAVIRVTGGE LPNQSGVIQE IIAPLADDGI
NIIDLITSAT SVAVFVDWDD REDALEIVQS RFD