Gene Hhal_0574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0574 
Symbol 
ID4709642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp652045 
End bp653370 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID639855032 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_001002162 
Protein GI121997375 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGTG TCGACACTCT GATCCACGCC CGCTGGGTGA TCCCGGTCGA GCCGACCGGG 
CACGTGCTTG AGGACCACGG GGTCGCGCTA CGCGACGGGC GCATCGTGGC CGTGGCCGGT
AACGACGACC TGCGCCGAGC CTACCAGGCC GACGAGCAAC GCGAACTCGG CGAGCACGTC
CTCATCCCGG GGCTGATCAA CGCCCACACG CACACCGCCA TGACCCTGCT GCGCGGCATG
GCCGACGACC TGCCGCTGAT GACCTGGCTG ACCGAGCACA TCTGGCCAGC CGAGCAGCGC
TGGGTGAGCG AGGCCTTCGT GCGCGACGGC AGCACCCTGG CCATGGGCGA GATGCTCCGC
GGCGGGGTGA CCTGCTTCAA CGACATGTAC TTCTACCCCG AAGTCACCGG CGAGGCCGCC
CGCCAGGTGG GCATGCGGGC ACTGCTCGGG ATGATCGTCA TCGGCGTCCC CAGTGGCTAC
GCGCAGAGCC TGGACGAGTA CCTGGAGAAG GGGCTGGCGC TCCACGAGCA GTTCCGCGAC
GACCCTTTGG TGCGCACCCT GTTCGCGCCC CACTCGCCCT ACACCGTAGA CGACAGCTTC
CTCGGCCGGA TCGGCGAGCA CGCCGAGCGG CTGGACGTGC CCATCCACAT CCACGTCCAG
GAGACTGCCG ACGAGGTCCA GCAGAGCCTG CGTGAGACCG GCAAGCGGCC CCTGCAGCGG
CTGGATGAGG TCGGTCTGGT CTCGCCCCGG CTGCTCGCCG TCCACGCCAC CCAGCTCGAA
TCCGCCGAGA TCGAGCGCCT GGCTGCCGCG GGTGCCCACG TGCTGCATTG CCCGGAGGCC
AACCTCAAGC TGGCCAGCGG CTTCTGCCCT GCCGCGGCGC TCACCCGGGC CGGGGTCAAT
GTAGCCCTGG GCACCGATGG CGTGGCCAGC AACAACGATC TCGATCTGAT CGGCGAGATG
CGCACCGCGG CCCTGCTGGC CAAGGCGGTC TCCGGCGACG CCGCTGCCCT CCCCGCCGAG
CAGGCCCTGG CTATGGTCAC CATCAACGCG GCACGGGCCT TCGGCCTGGA CGACGAGATC
GGCTCCATCG TCCCCGGCAA GGCCGCCGAC CTGACCGCCA TCTCGTTGGC GGACCTCAAC
CAGCACCCGA TCTACAACCC GCTCTCGCAG CTGGTCTACG CCGCCAACCG CCAGCACGTC
ACTGACGTCT GGGTGGGCGG TCAGCCCCGG GTGCGCAACG GCCAGCTGAC CACGCTGGAT
ACCGCCGAGA CCATTGCCCG TGCCGAGCAG TGGCGGGAGC GGATCGCCGC AGAACGGGCG
CAATAG
 
Protein sequence
MERVDTLIHA RWVIPVEPTG HVLEDHGVAL RDGRIVAVAG NDDLRRAYQA DEQRELGEHV 
LIPGLINAHT HTAMTLLRGM ADDLPLMTWL TEHIWPAEQR WVSEAFVRDG STLAMGEMLR
GGVTCFNDMY FYPEVTGEAA RQVGMRALLG MIVIGVPSGY AQSLDEYLEK GLALHEQFRD
DPLVRTLFAP HSPYTVDDSF LGRIGEHAER LDVPIHIHVQ ETADEVQQSL RETGKRPLQR
LDEVGLVSPR LLAVHATQLE SAEIERLAAA GAHVLHCPEA NLKLASGFCP AAALTRAGVN
VALGTDGVAS NNDLDLIGEM RTAALLAKAV SGDAAALPAE QALAMVTINA ARAFGLDDEI
GSIVPGKAAD LTAISLADLN QHPIYNPLSQ LVYAANRQHV TDVWVGGQPR VRNGQLTTLD
TAETIARAEQ WRERIAAERA Q