Gene Rsph17029_1985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1985 
Symbol 
ID4895673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2103787 
End bp2105064 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content69% 
IMG OID640112579 
Productcytosine deaminase 
Protein accessionYP_001043861 
Protein GI126462747 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.457104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATC TGATCGTCAA GGGGGGCACG CTGCCCGATG GGCGCGTGGC CGATGTGGGC 
ATCCGGGGCG ACCGGATCGC GGCCATTGGG GCGCTCGGGA CCGACGCGGC GCGGCTGATC
GAGGCCACGG GCGATCTGGT GAGCCCGGCC TTCGTCGATC CGCATTTCCA CATGGATGCG
ACGCTCTCCT ACGGGCTGCC GCGCGTGAAT GCGAGCGGGA CGCTGCTGGA AGGGATCGGG
CTCTGGGGCG AGCTGAAGGA GATCGTGACC GTCGAGGCCA TGGTCGAGCG GGCGCTGGCC
TATTGCGACT GGGCGGCGAG CATGGGCCTT CTGGCGGTGC GGACCCATGT CGATGTCTGC
GACGACCGGC TGCTGGGCGT CGAGGCGATG CTCGCCGTGC GCGAAAAGGT CAAGGGCTGG
ATGGATCTCC AGCTCGTGGC GTTCCCGCAG GACGGGCTCT ACCGCGACCC GACCGCGCGG
GCGAACCTCT TGCGCGCGCT CGACATGGGC GTGGATGTGG TGGGGGGCAT CCCGCATTTC
GAGCGGACGA TGGCGGACGG CGCGGCCTCG GTGCGCGACC TCTGCGAGAT CGCGGCCGAC
CGCGGGCTGC CGATCGATTT CCACTGCGAC GAGACCGACG ATCCGCTGAG CCGCCATATC
GAGACCTATG CCGCCGAGGT GCTGCGCACG GGGCTTCAGG GCCGTGCCGC AGCGGGGCAC
CTGACCTCGA TGCATTCGAT GGACAATTAC TATGTCTCGA AGCTTCTGCC GCTGATCGCC
GAGGCCGGGA TCGCCGCCAT CCCGAACCCG CTCATCAACA TCGTGCTGCA GGGCCGCCAC
GACAGTTTCC CGAAGCGCCG CGGGCTCACG CGGATCAAGG AGATGCAGGC CATGGGCATC
ACGGTGGGCT GGGGGCAGGA TTGCGTGCTC GACCCGTGGT ATTCGCTGGG CACCGCCGAC
ATGCTCGACG TGGCCTTCAT GGGGCTGCAT GTGGCGCAGA TGACCCATCC CGACGAGATG
CGGCGCTGCT TCGACATGGT GACGGTCGAG AATGCCAGGA TCATGGGCCT CGACTACGGG
CTGCGGGAGG GGGCGGTGGC CTCGCTCGTG GTGCTCGACG CGGGCCATCC GGTCGAGGCG
CTGCGGCTCC GGCCCGACCG GCTCTGCGTG ATCGCGAAGG GGCGGGTGGT GTCGGAGAAG
GCGCGCAACG ACGCGCGCCT GAGCCTGCCG GGGAGGCCGG AGACCGTGCG CCGTCGCCAC
CTCCTGCCCC AGCGCTGA
 
Protein sequence
MFDLIVKGGT LPDGRVADVG IRGDRIAAIG ALGTDAARLI EATGDLVSPA FVDPHFHMDA 
TLSYGLPRVN ASGTLLEGIG LWGELKEIVT VEAMVERALA YCDWAASMGL LAVRTHVDVC
DDRLLGVEAM LAVREKVKGW MDLQLVAFPQ DGLYRDPTAR ANLLRALDMG VDVVGGIPHF
ERTMADGAAS VRDLCEIAAD RGLPIDFHCD ETDDPLSRHI ETYAAEVLRT GLQGRAAAGH
LTSMHSMDNY YVSKLLPLIA EAGIAAIPNP LINIVLQGRH DSFPKRRGLT RIKEMQAMGI
TVGWGQDCVL DPWYSLGTAD MLDVAFMGLH VAQMTHPDEM RRCFDMVTVE NARIMGLDYG
LREGAVASLV VLDAGHPVEA LRLRPDRLCV IAKGRVVSEK ARNDARLSLP GRPETVRRRH
LLPQR