Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1057 |
Symbol | |
ID | 7400129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1053365 |
End bp | 1054645 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643708125 |
Product | amidohydrolase |
Protein accession | YP_002565724 |
Protein GI | 222479487 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCACG ACGCGCGAGC CAGACTGAGC GACCTCCGGC GGACGTTCCA CCGCCACCCG GAGCCGGGGT GGCGCGAGTT TCAGACGACC GCCCGCGTCG TCGAGGAGCT GGAGCGAATC GGCGTCGACG AGATTGCCGT CGGTCGCGAG GCGCTCGCGA CCGATGCGCG GATGGCCGTC CCCGACGACG ACGAGATCCA GCCGTGGCTC GACCGCGCTC GTCGGGCCGG GGTCAGCGAC GACCTCCTCG AACGCACTGC GGGTGGCCAC ACGGGCGTCG TCGCGACGCT CTCACAGGGC GAGGGGCCGT GTATCGGGCT GCGCGTCGAT CTCGACGCGA TCTCGATTCA CGAATCGGAG GAACGCGACC ACCGGCCGGA GGCGGAGGGG TTCCGCTCGG AACACGACGG GTACATGCAC GCCTGCGGCC ACGACGCGCA CCTCGCGATC GCACTCGGGA CGCTAGAGGC GGTCAAACAG AGCGCGTTCG AGGGAACGCT CAAGGTGCTC TTCCAGCCGG CAGAGGAGAT TTCCGGGGGC GGCAAGGCGA TGGCCGAGAG CGGCCACCTC GACGGCGTCG ATTACCTGTT TGCGCTCCAC GTCGGCCTCG ATCACCCAAC AGGTGAGATC GTCGCCGGCG TGGAGAGCCC GCTGGCGATG GCACACCTGA CGGCCACGTT CGAGGGCGCG AGCGCACACG CGGGAAAGGC GCCGAACGAG GGGGCGAACG CCATGCAGGC CGCCGCGGTC GCGATCCAGA ACGCGTACGG AATAGCCCGC CACCGCGACG GAGCGACACG GGTGAACGTC GGCCGGATCG AAGGCGGCTC CGCGAGCAAC GTCATCGCCG AGGAGGTGAC GATCGACGCC GAGGTCCGTG GTGAGACGAC CGCGCTGATG ACGTACGCAC GCACCGAGCT CGAACGGATA CTGTACGCCG CCGCCGAGCT CCACGACTGT GACGTCACGC CGCACGTGAT CAGCGAATCG CCGTGTGTCG ACAGCCACCC GGCGCTTCAA GAGGTCGTCG GAAACGTGGC GTGGGGCGTC GACGGCGTCG AACATGTGAT CCCGTCCGAA GAGTTCGGCG TGAGCGAAGA CGGAACCTAC CTGATGCAGC AGGTACAGGA CGCCGGCGGG CTCGCGTCGT ACGTCCTCGT CGGGACGGAC CATCCGACGA GCCACCACAC CCCGACCTTT GACATCGATG AAGAGAGTCT CGCGATCGGT GTCAATATTC TGTCAGAAAC GTTCGTCGAA CTCTCGCGGC GTCGACCGTA G
|
Protein sequence | MSHDARARLS DLRRTFHRHP EPGWREFQTT ARVVEELERI GVDEIAVGRE ALATDARMAV PDDDEIQPWL DRARRAGVSD DLLERTAGGH TGVVATLSQG EGPCIGLRVD LDAISIHESE ERDHRPEAEG FRSEHDGYMH ACGHDAHLAI ALGTLEAVKQ SAFEGTLKVL FQPAEEISGG GKAMAESGHL DGVDYLFALH VGLDHPTGEI VAGVESPLAM AHLTATFEGA SAHAGKAPNE GANAMQAAAV AIQNAYGIAR HRDGATRVNV GRIEGGSASN VIAEEVTIDA EVRGETTALM TYARTELERI LYAAAELHDC DVTPHVISES PCVDSHPALQ EVVGNVAWGV DGVEHVIPSE EFGVSEDGTY LMQQVQDAGG LASYVLVGTD HPTSHHTPTF DIDEESLAIG VNILSETFVE LSRRRP
|
| |