Gene Hlac_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1973 
Symbol 
ID7399925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1970495 
End bp1971532 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID643709044 
ProductSuccinylglutamate desuccinylase/aspartoacylase 
Protein accessionYP_002566621 
Protein GI222480384 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0907521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG AACGGGTGTT CACGTACAAC GGCGGCGCGG TACCGCCGGG CGAGACGCAG 
AACATCCGCT ACGGCATCAG CGAGACGTAC CTCGGCGACC CGGTTCGGAT CCCCGTGACG
ATCGTCAACG GCGAGCGCGA CGGGCCGACA GCGTTCCTCA TGGCGGCCGC CCACGGCGAC
GAGCTCAACG GTATCGAGGT CGTCCGCGAG GTCGCCCACG AGTGGGACCT CTCGAAGCTC
GCTGGCACCC TCGTCTGTCT CCCAGTGCTC AACGTTCCGG GGTTCCTCGC CCAACAGCGC
TACCTCCCCG TCTACGACCG CGACCTGAAT CGGTCGTTCC CCGGGAAGGC CGGCTCGACC
AGCTCGAAGC GGATGGCGAA TCAGATCTAC TCGAACTTCA TCGCGCCCTG TGATTTCGGG
CTCGACTTCC ACACTTCCAC CCGCGGTCGA ACGAACATGC TCCACGTCCG CGGCGACATG
ACCGACGACG GCGTTCACCG CCTCGCGTTG GCCTTCGGCT CGAAGGTGGT CATCGACAGC
GACGGACCGA GCGGCACCCT CCGCGGCGAG GCGACCGCCG ACGGGATTCC CACGATCACG
ATCGAGATGG GCGAGGCGCA CCGGTTCCAG CGCCCGCTTA TCGACGACGC GCTCGCGGGG
GTACGCTCCG TCTTCGCCGA GTACAGCCTC TTAGATACCG ATACGGTGCG TTGGCCCGGC
TGGCGGACGA TCGTCGCCGG TACGGGCGAG AAGACGTGGC TCCGGGCAGA CTCCGGCGGG
ATCGTCGACA CCCACTTCGA GAGCGGCTCA CTCGTTCACG AGGGCCAGCG GATCGCGACG
ATCACCAACC CGTTCAAGAA AGACGAGGTC GTGGTCGAAG CACCCTTCAC CGGCCTGCTG
ATCGGCCTCC TAGAGAACCC GGTCGTTTAC CCCGGGAATC CGCTGTGTCA CCTCGTCGAG
ATCGATGAAT CGACTCGGCG AGCGATCGAA GCCGGTGACG CCCCGGAGCC CGTCGGACAG
CCGAACGCAG CGGAGTGA
 
Protein sequence
MSDERVFTYN GGAVPPGETQ NIRYGISETY LGDPVRIPVT IVNGERDGPT AFLMAAAHGD 
ELNGIEVVRE VAHEWDLSKL AGTLVCLPVL NVPGFLAQQR YLPVYDRDLN RSFPGKAGST
SSKRMANQIY SNFIAPCDFG LDFHTSTRGR TNMLHVRGDM TDDGVHRLAL AFGSKVVIDS
DGPSGTLRGE ATADGIPTIT IEMGEAHRFQ RPLIDDALAG VRSVFAEYSL LDTDTVRWPG
WRTIVAGTGE KTWLRADSGG IVDTHFESGS LVHEGQRIAT ITNPFKKDEV VVEAPFTGLL
IGLLENPVVY PGNPLCHLVE IDESTRRAIE AGDAPEPVGQ PNAAE