Gene Csal_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1786 
Symbol 
ID4028585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2031634 
End bp2032995 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content67% 
IMG OID637966974 
Productguanine deaminase 
Protein accessionYP_573837 
Protein GI92113909 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACT CCCGACTCCT GCGCGGTGCC GTGCTGACCT TCGACGACGA TCCCGGCGAG 
TCCCCCGTGC CACGCCCGGA CAGTCTGCGC TACTGGGAGG ACGGCGCCGT CTGGCTGGAG
AACGGCCATA TCCGCGCCGT CGATGACTAC ACCACGCTGG CACCGCACGT GCCGGCAGGG
CTCGAGATCG TCGACTACCG TGGCAAATTG ATCATGCCCG GCTTCATCGA CAGCCATGTG
CATTATTCGC AGCTCGACAT CATCGCCTCG TTCGGACGCG AACTGCTCGA CTGGCTCAAC
GACTACACCT TTCCCGCCGA ATGTCGCTTC GCCGAACGGG CGCATGCCGA GGAGGTCGCC
GAGCGATTCC TTGATGAACT CCTGCGCGGC GGCACCACCA CCGCCCAGGT GTTCTGCACC
TCGCATCCCG GCTCGGTGGA CAGCATCTTC TCCGCGGCCC GAGCCCGCCG ACTGCGAATG
CTGGCCGGCA AGGTACTGAT GGATCGCCAT GCCCCCGAGG CCCTGATCGA CACCGCCGTC
GGCGGCATCC GCGACAGCGA ACGGCTGATC GCCGACTGGC ACGGCAAGAA CCGTCTGGCG
TATTCGCTGA CACCCCGCTT CGCGCCGACA TCCAGCCGCG AGCAACTGGA TGCCGTGGGC
GGCGTGCTGC GCAACGATGC CAGCCTGTAT CTGCAAAGCC ACCTCTCGGA ACACCGTGGC
GAACTGGCCT GGGTCGCCGA GCTGTTTCCC GAATGCCGCG ACTATCTCGC CGTCTACGAA
CGCCATGGCC TGGTCGGTCC GCGCAGCACC TATGCCCACG GCATCCATCT TTCCGACGAC
GAACGCGCAC GACTCGCCGA GACCGGCGCC AACATCGCCT TTTCACCGAC CTCCAACCTG
TTTTTGGGCA GCGGGCTCTT CGACCGCATC GCCACACGCG AAGCGGGCGT GGTCACCTCC
CTGGCCAGCG ACGTGGGCGC TGGCACCGGC CTGTGCGGCT TGACGACCCT GCAAGGCGCC
TATCAGGTGG GCGCCTTGCT CGGCCAGCCG CTGACGGCAT GGCAAGGGTT CTATCGGCTC
ACGCTGGGCA ACGCCCGTGC CCTGCATCTG GAACATTGCA TCGGCCGCCT CGAGGCCGGC
CACGAAGCCG ACCTGGTCGT GCTGGACCTC GCCGCCACCC CCCTCATGGC ACGGCGAACC
CAGGTCGCCG AAACGCTCGG CGAGCGCCTT TTCGCGCTGA TGATGCTGGG TGACGACCGC
AGCGTCCACG CCACCTGGGC CAGCGGCCGG CCGGTGCACC AGCGTGATGC AAGCGATACG
CACGCCGCCC CCTCGAGGCG CATGGCACAT TCCCCCACAT GA
 
Protein sequence
MTDSRLLRGA VLTFDDDPGE SPVPRPDSLR YWEDGAVWLE NGHIRAVDDY TTLAPHVPAG 
LEIVDYRGKL IMPGFIDSHV HYSQLDIIAS FGRELLDWLN DYTFPAECRF AERAHAEEVA
ERFLDELLRG GTTTAQVFCT SHPGSVDSIF SAARARRLRM LAGKVLMDRH APEALIDTAV
GGIRDSERLI ADWHGKNRLA YSLTPRFAPT SSREQLDAVG GVLRNDASLY LQSHLSEHRG
ELAWVAELFP ECRDYLAVYE RHGLVGPRST YAHGIHLSDD ERARLAETGA NIAFSPTSNL
FLGSGLFDRI ATREAGVVTS LASDVGAGTG LCGLTTLQGA YQVGALLGQP LTAWQGFYRL
TLGNARALHL EHCIGRLEAG HEADLVVLDL AATPLMARRT QVAETLGERL FALMMLGDDR
SVHATWASGR PVHQRDASDT HAAPSRRMAH SPT