Gene Hlac_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1057 
Symbol 
ID7400129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1053365 
End bp1054645 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID643708125 
Productamidohydrolase 
Protein accessionYP_002565724 
Protein GI222479487 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACG ACGCGCGAGC CAGACTGAGC GACCTCCGGC GGACGTTCCA CCGCCACCCG 
GAGCCGGGGT GGCGCGAGTT TCAGACGACC GCCCGCGTCG TCGAGGAGCT GGAGCGAATC
GGCGTCGACG AGATTGCCGT CGGTCGCGAG GCGCTCGCGA CCGATGCGCG GATGGCCGTC
CCCGACGACG ACGAGATCCA GCCGTGGCTC GACCGCGCTC GTCGGGCCGG GGTCAGCGAC
GACCTCCTCG AACGCACTGC GGGTGGCCAC ACGGGCGTCG TCGCGACGCT CTCACAGGGC
GAGGGGCCGT GTATCGGGCT GCGCGTCGAT CTCGACGCGA TCTCGATTCA CGAATCGGAG
GAACGCGACC ACCGGCCGGA GGCGGAGGGG TTCCGCTCGG AACACGACGG GTACATGCAC
GCCTGCGGCC ACGACGCGCA CCTCGCGATC GCACTCGGGA CGCTAGAGGC GGTCAAACAG
AGCGCGTTCG AGGGAACGCT CAAGGTGCTC TTCCAGCCGG CAGAGGAGAT TTCCGGGGGC
GGCAAGGCGA TGGCCGAGAG CGGCCACCTC GACGGCGTCG ATTACCTGTT TGCGCTCCAC
GTCGGCCTCG ATCACCCAAC AGGTGAGATC GTCGCCGGCG TGGAGAGCCC GCTGGCGATG
GCACACCTGA CGGCCACGTT CGAGGGCGCG AGCGCACACG CGGGAAAGGC GCCGAACGAG
GGGGCGAACG CCATGCAGGC CGCCGCGGTC GCGATCCAGA ACGCGTACGG AATAGCCCGC
CACCGCGACG GAGCGACACG GGTGAACGTC GGCCGGATCG AAGGCGGCTC CGCGAGCAAC
GTCATCGCCG AGGAGGTGAC GATCGACGCC GAGGTCCGTG GTGAGACGAC CGCGCTGATG
ACGTACGCAC GCACCGAGCT CGAACGGATA CTGTACGCCG CCGCCGAGCT CCACGACTGT
GACGTCACGC CGCACGTGAT CAGCGAATCG CCGTGTGTCG ACAGCCACCC GGCGCTTCAA
GAGGTCGTCG GAAACGTGGC GTGGGGCGTC GACGGCGTCG AACATGTGAT CCCGTCCGAA
GAGTTCGGCG TGAGCGAAGA CGGAACCTAC CTGATGCAGC AGGTACAGGA CGCCGGCGGG
CTCGCGTCGT ACGTCCTCGT CGGGACGGAC CATCCGACGA GCCACCACAC CCCGACCTTT
GACATCGATG AAGAGAGTCT CGCGATCGGT GTCAATATTC TGTCAGAAAC GTTCGTCGAA
CTCTCGCGGC GTCGACCGTA G
 
Protein sequence
MSHDARARLS DLRRTFHRHP EPGWREFQTT ARVVEELERI GVDEIAVGRE ALATDARMAV 
PDDDEIQPWL DRARRAGVSD DLLERTAGGH TGVVATLSQG EGPCIGLRVD LDAISIHESE
ERDHRPEAEG FRSEHDGYMH ACGHDAHLAI ALGTLEAVKQ SAFEGTLKVL FQPAEEISGG
GKAMAESGHL DGVDYLFALH VGLDHPTGEI VAGVESPLAM AHLTATFEGA SAHAGKAPNE
GANAMQAAAV AIQNAYGIAR HRDGATRVNV GRIEGGSASN VIAEEVTIDA EVRGETTALM
TYARTELERI LYAAAELHDC DVTPHVISES PCVDSHPALQ EVVGNVAWGV DGVEHVIPSE
EFGVSEDGTY LMQQVQDAGG LASYVLVGTD HPTSHHTPTF DIDEESLAIG VNILSETFVE
LSRRRP