Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0574 |
Symbol | |
ID | 4709642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 652045 |
End bp | 653370 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855032 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_001002162 |
Protein GI | 121997375 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGTG TCGACACTCT GATCCACGCC CGCTGGGTGA TCCCGGTCGA GCCGACCGGG CACGTGCTTG AGGACCACGG GGTCGCGCTA CGCGACGGGC GCATCGTGGC CGTGGCCGGT AACGACGACC TGCGCCGAGC CTACCAGGCC GACGAGCAAC GCGAACTCGG CGAGCACGTC CTCATCCCGG GGCTGATCAA CGCCCACACG CACACCGCCA TGACCCTGCT GCGCGGCATG GCCGACGACC TGCCGCTGAT GACCTGGCTG ACCGAGCACA TCTGGCCAGC CGAGCAGCGC TGGGTGAGCG AGGCCTTCGT GCGCGACGGC AGCACCCTGG CCATGGGCGA GATGCTCCGC GGCGGGGTGA CCTGCTTCAA CGACATGTAC TTCTACCCCG AAGTCACCGG CGAGGCCGCC CGCCAGGTGG GCATGCGGGC ACTGCTCGGG ATGATCGTCA TCGGCGTCCC CAGTGGCTAC GCGCAGAGCC TGGACGAGTA CCTGGAGAAG GGGCTGGCGC TCCACGAGCA GTTCCGCGAC GACCCTTTGG TGCGCACCCT GTTCGCGCCC CACTCGCCCT ACACCGTAGA CGACAGCTTC CTCGGCCGGA TCGGCGAGCA CGCCGAGCGG CTGGACGTGC CCATCCACAT CCACGTCCAG GAGACTGCCG ACGAGGTCCA GCAGAGCCTG CGTGAGACCG GCAAGCGGCC CCTGCAGCGG CTGGATGAGG TCGGTCTGGT CTCGCCCCGG CTGCTCGCCG TCCACGCCAC CCAGCTCGAA TCCGCCGAGA TCGAGCGCCT GGCTGCCGCG GGTGCCCACG TGCTGCATTG CCCGGAGGCC AACCTCAAGC TGGCCAGCGG CTTCTGCCCT GCCGCGGCGC TCACCCGGGC CGGGGTCAAT GTAGCCCTGG GCACCGATGG CGTGGCCAGC AACAACGATC TCGATCTGAT CGGCGAGATG CGCACCGCGG CCCTGCTGGC CAAGGCGGTC TCCGGCGACG CCGCTGCCCT CCCCGCCGAG CAGGCCCTGG CTATGGTCAC CATCAACGCG GCACGGGCCT TCGGCCTGGA CGACGAGATC GGCTCCATCG TCCCCGGCAA GGCCGCCGAC CTGACCGCCA TCTCGTTGGC GGACCTCAAC CAGCACCCGA TCTACAACCC GCTCTCGCAG CTGGTCTACG CCGCCAACCG CCAGCACGTC ACTGACGTCT GGGTGGGCGG TCAGCCCCGG GTGCGCAACG GCCAGCTGAC CACGCTGGAT ACCGCCGAGA CCATTGCCCG TGCCGAGCAG TGGCGGGAGC GGATCGCCGC AGAACGGGCG CAATAG
|
Protein sequence | MERVDTLIHA RWVIPVEPTG HVLEDHGVAL RDGRIVAVAG NDDLRRAYQA DEQRELGEHV LIPGLINAHT HTAMTLLRGM ADDLPLMTWL TEHIWPAEQR WVSEAFVRDG STLAMGEMLR GGVTCFNDMY FYPEVTGEAA RQVGMRALLG MIVIGVPSGY AQSLDEYLEK GLALHEQFRD DPLVRTLFAP HSPYTVDDSF LGRIGEHAER LDVPIHIHVQ ETADEVQQSL RETGKRPLQR LDEVGLVSPR LLAVHATQLE SAEIERLAAA GAHVLHCPEA NLKLASGFCP AAALTRAGVN VALGTDGVAS NNDLDLIGEM RTAALLAKAV SGDAAALPAE QALAMVTINA ARAFGLDDEI GSIVPGKAAD LTAISLADLN QHPIYNPLSQ LVYAANRQHV TDVWVGGQPR VRNGQLTTLD TAETIARAEQ WRERIAAERA Q
|
| |