Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1985 |
Symbol | |
ID | 4895673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2103787 |
End bp | 2105064 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640112579 |
Product | cytosine deaminase |
Protein accession | YP_001043861 |
Protein GI | 126462747 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.457104 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATC TGATCGTCAA GGGGGGCACG CTGCCCGATG GGCGCGTGGC CGATGTGGGC ATCCGGGGCG ACCGGATCGC GGCCATTGGG GCGCTCGGGA CCGACGCGGC GCGGCTGATC GAGGCCACGG GCGATCTGGT GAGCCCGGCC TTCGTCGATC CGCATTTCCA CATGGATGCG ACGCTCTCCT ACGGGCTGCC GCGCGTGAAT GCGAGCGGGA CGCTGCTGGA AGGGATCGGG CTCTGGGGCG AGCTGAAGGA GATCGTGACC GTCGAGGCCA TGGTCGAGCG GGCGCTGGCC TATTGCGACT GGGCGGCGAG CATGGGCCTT CTGGCGGTGC GGACCCATGT CGATGTCTGC GACGACCGGC TGCTGGGCGT CGAGGCGATG CTCGCCGTGC GCGAAAAGGT CAAGGGCTGG ATGGATCTCC AGCTCGTGGC GTTCCCGCAG GACGGGCTCT ACCGCGACCC GACCGCGCGG GCGAACCTCT TGCGCGCGCT CGACATGGGC GTGGATGTGG TGGGGGGCAT CCCGCATTTC GAGCGGACGA TGGCGGACGG CGCGGCCTCG GTGCGCGACC TCTGCGAGAT CGCGGCCGAC CGCGGGCTGC CGATCGATTT CCACTGCGAC GAGACCGACG ATCCGCTGAG CCGCCATATC GAGACCTATG CCGCCGAGGT GCTGCGCACG GGGCTTCAGG GCCGTGCCGC AGCGGGGCAC CTGACCTCGA TGCATTCGAT GGACAATTAC TATGTCTCGA AGCTTCTGCC GCTGATCGCC GAGGCCGGGA TCGCCGCCAT CCCGAACCCG CTCATCAACA TCGTGCTGCA GGGCCGCCAC GACAGTTTCC CGAAGCGCCG CGGGCTCACG CGGATCAAGG AGATGCAGGC CATGGGCATC ACGGTGGGCT GGGGGCAGGA TTGCGTGCTC GACCCGTGGT ATTCGCTGGG CACCGCCGAC ATGCTCGACG TGGCCTTCAT GGGGCTGCAT GTGGCGCAGA TGACCCATCC CGACGAGATG CGGCGCTGCT TCGACATGGT GACGGTCGAG AATGCCAGGA TCATGGGCCT CGACTACGGG CTGCGGGAGG GGGCGGTGGC CTCGCTCGTG GTGCTCGACG CGGGCCATCC GGTCGAGGCG CTGCGGCTCC GGCCCGACCG GCTCTGCGTG ATCGCGAAGG GGCGGGTGGT GTCGGAGAAG GCGCGCAACG ACGCGCGCCT GAGCCTGCCG GGGAGGCCGG AGACCGTGCG CCGTCGCCAC CTCCTGCCCC AGCGCTGA
|
Protein sequence | MFDLIVKGGT LPDGRVADVG IRGDRIAAIG ALGTDAARLI EATGDLVSPA FVDPHFHMDA TLSYGLPRVN ASGTLLEGIG LWGELKEIVT VEAMVERALA YCDWAASMGL LAVRTHVDVC DDRLLGVEAM LAVREKVKGW MDLQLVAFPQ DGLYRDPTAR ANLLRALDMG VDVVGGIPHF ERTMADGAAS VRDLCEIAAD RGLPIDFHCD ETDDPLSRHI ETYAAEVLRT GLQGRAAAGH LTSMHSMDNY YVSKLLPLIA EAGIAAIPNP LINIVLQGRH DSFPKRRGLT RIKEMQAMGI TVGWGQDCVL DPWYSLGTAD MLDVAFMGLH VAQMTHPDEM RRCFDMVTVE NARIMGLDYG LREGAVASLV VLDAGHPVEA LRLRPDRLCV IAKGRVVSEK ARNDARLSLP GRPETVRRRH LLPQR
|
| |