Gene Hlac_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1974 
Symbol 
ID7399926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1971630 
End bp1972874 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID643709045 
Productalpha-L-glutamate ligase, RimK family 
Protein accessionYP_002566622 
Protein GI222480385 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID[TIGR00768] alpha-L-glutamate ligases, RimK family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0918709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGG ACCCGGTACG GGTGGGAGTA CTGTCGCTGC ACAACAGCAA GGAGACGAAG 
GCGATCTGCA ACGCCGTCGA GGATCTGGGC CACGAGCCGG TGTGGCTCCG CGAGGAGAAC
GCGGCGATAA GCGTGGAAGA CGGCGACGTG TCGGTGGAGC CCGCCGTCGA CATCATCGCG
AACCGGCTGC TGTTGTCGAA CACGGATCAG CCCGCGGAGC TGCTCGGGCT GGCGACGACG
TTCGAGCGCA TCAGGCCGAT GTTGAACGAG CCGAACGCCG TGCTCGCGTC AATCCACAAG
TTCGCCACGG CGGCGACGCT CGCCGACTGG AACATCCGCG TGCCGGACGC GCTGCTCGCG
CTGTCGAACG ACCGGCTCAA CGAGGGGCGC GAGCGCTTCG GCGAGGTCGG AGTTTATAAG
ACGGCGATCG GCACGCACGG CGGCGGAACG TGGAAGGTCG ACCTCACCGA GCGCGTGAAC
CCGAAGGTGG GGAACCGGCA GGCGTTCCTC CAGAAGCTCA TCGACCGCGA TGACGAGGAA
CACCGCGACC TGCGCGTGTA CATCGTCGGC GACGAGATCA TCGGCTCGAT GTACCGCTAC
GCGCCCGAGG GCGACTGGCG GACCAACGTC GCGCTCGGCG GCGCCGTCGA GGACGCGACC
GACGACATGC CCGACGAAGC GGCGGAGACA GCGCTGTACT CGGCCGACGT GATGGACCTC
GACTACGCCG GCGTCGACCT CGTCGAGGGG TACGACGGCT GGTACGTCCT CGAAGTGAAC
CCGACGGCCG GGTTTAAAGG GCTGTTCGAG GCGACCGGCA CGAGCCCCGC CCCGTACATC
GCGAAGCTCG CGATCGAGGC GGTCGACGGC GAGGTCGACG ACGACGATGT CGAGCGCATC
GCGGCGACGC TCGACGACTC GCGACCCGCC TGTGCCCCGC TCCCGAAAAC GAACAGCTCC
GAGCAGCCCG ACATCGGGTA TATCGAGGAG GTCGTCGTCA CCGGCACCTC CGGGTCGACG
CAGGCGCTCG CGAAGTCGGA CACGGGCGCG ACCCGAACCA GCATCGACAC ACAGCTCGCC
GCCGAGATCG GCGCCGGGCC GATCAAGAGC ATGACGCGCG TAAAATCCGG CAGCATGAAG
GGCGGGAAGG CCCGCCCCGT CGTCGACCTC GTGATCGGTA TCGGCGGGAA CCAGCACACC
GTCACCGCCA GCGTCGAGGA CCGCGGCCAC ATGGACTATC TTTAA
 
Protein sequence
MSSDPVRVGV LSLHNSKETK AICNAVEDLG HEPVWLREEN AAISVEDGDV SVEPAVDIIA 
NRLLLSNTDQ PAELLGLATT FERIRPMLNE PNAVLASIHK FATAATLADW NIRVPDALLA
LSNDRLNEGR ERFGEVGVYK TAIGTHGGGT WKVDLTERVN PKVGNRQAFL QKLIDRDDEE
HRDLRVYIVG DEIIGSMYRY APEGDWRTNV ALGGAVEDAT DDMPDEAAET ALYSADVMDL
DYAGVDLVEG YDGWYVLEVN PTAGFKGLFE ATGTSPAPYI AKLAIEAVDG EVDDDDVERI
AATLDDSRPA CAPLPKTNSS EQPDIGYIEE VVVTGTSGST QALAKSDTGA TRTSIDTQLA
AEIGAGPIKS MTRVKSGSMK GGKARPVVDL VIGIGGNQHT VTASVEDRGH MDYL