Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2243 |
Symbol | |
ID | 7399953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2231002 |
End bp | 2231901 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643709317 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_002566890 |
Protein GI | 222480653 |
COG category | [R] General function prediction only |
COG ID | [COG3970] Fumarylacetoacetate (FAA) hydrolase family protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0481778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.293361 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCTG CGCATCAACA GGGTATGCGG TATTACCGAC TCCCCGGCGG AGAGCACAGC GAGGGGTCCG GCTCGCTCGT CGTCGTCGAC GACGGCGACG CCTACGACCT GTCAACGGCA TCAGACGATC TCGGGTCGTT CACCCAGCTC GCTCGCGCCG CGAACGCGTG CGACGAGTCC ATCGATTCCA TCGCTCGGGA CCGGATCTCC GACTCGGAGT CGGTGGCGTT CGACGACGAC GACGTACTCC TGCCGGTGAC GGCCGACGAG GTGTGGGCCG CGGGGGTCAC GTACCAGATC AGCGAACAGG CGCGCGAGGC CGAGAGCGGC AAGCCCGAGG TGTACATCGA CGTCTACGAC AGCGAGCGCC CGGAGCTGTT CTTGAAGGCG ACGCCCTCGC GGACAGTCGG CCCCAACGAG GCCATCGGGA TCCGCGGCGA CTCCACGTGG GACGTGCCGG AGCCGGAGCT CGGCGTCGTG CTCCACCGCG AAGAGGTCGT TGGCTACACG ATCGGCAACG ACGTATCGAG CCGAGCCATC GAAGGCGAGA ACCCGCTGTA CCTCCCGCAG GCGAAGGTGT ACGACCGATG CTGTTCGGTC GGCCCGTGCG TGGCGACAGC GGATGTCGTC GACGATCCCC ACGACCTCGA GATGGCGCTC AGCATCGAGC GCGACGGCGA GGTCGTCTTC GAGGACTCGA CGTCGACGAA CGAGATGGCG ACCACCTGCG AAAACCTCGT TTCGTACCTC CGGCGGCACA ACAACCTCCC GGAGACCGTC GTCCTGCTGA CCGGCACCGC ACTCGTGCCG CCGGAGTCGT TCACGCTCAC CGAGGGTGAC CAGGTCACGA TCGACATCGA CAAGATCGGA CAGCTCGTCA ACGACACTAT CGTCGTCTGA
|
Protein sequence | MASAHQQGMR YYRLPGGEHS EGSGSLVVVD DGDAYDLSTA SDDLGSFTQL ARAANACDES IDSIARDRIS DSESVAFDDD DVLLPVTADE VWAAGVTYQI SEQAREAESG KPEVYIDVYD SERPELFLKA TPSRTVGPNE AIGIRGDSTW DVPEPELGVV LHREEVVGYT IGNDVSSRAI EGENPLYLPQ AKVYDRCCSV GPCVATADVV DDPHDLEMAL SIERDGEVVF EDSTSTNEMA TTCENLVSYL RRHNNLPETV VLLTGTALVP PESFTLTEGD QVTIDIDKIG QLVNDTIVV
|
| |