Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2788 |
Symbol | |
ID | 3836228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3222171 |
End bp | 3223493 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637826899 |
Product | cytosine deaminase |
Protein accession | YP_427872 |
Protein GI | 83594120 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.22354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACTT TTCCCCCCGC CCAGGACTTG ATCGTCCGCA ACGCCAGCCT GCCCGATGGC CGCCGCGCTC AGGACATCCT GATCCTGGCG GGCACGATCA AAGCCATCGG CCCGGCCCTG GACGCGCCCG AGGCCACCCC GGTTCTTGAC GCCAAAGGCG ATCTGGTCGC CCCGCCCTTC GTCGATGCCC ATTTCCATAT GGACTCGGCA CTGTCTTACG GCCTGCCGCG GATCAACGCC TCGGGCACCT TGCTCGAAGG CATCGCCCTG TGGGGGGAAC TCAAGCCCGA GTTGACCCAT CAGGCGATTG TTGATCGGGC GCTTGCCTAT TGCGACTGGG CGATCGGTCG CGGCCTGCTG GCCATCCGCA GCCATGTCGA TATCTGCGAT CCCCGCCTGC TCGCCGTCGA CGCCCTGCTC GAAGTCCGCG AAAAAGTCGC CCCCTGGCTG ACCTTGCAGC TTGTCGCCTT CCCCCAGGAT GGCTTCTTGC GCGCCCAAGG CAGCCGCGAG CGCCTGATCG CCGCCCTGGA TCGCGGCGTC GATGTGGTCG GCGGCATCCC GCATTTCGAA CGCACGATGG CTCAAGGCGC CGAAAGCGTG CGCGCGCTGT GCGAGATCGC CGCCGAGCGC GGCTTGCGCG TGGATATGCA TTGCGATGAA AGCGACGACC CGATGAGCCG CCACGTCGAG ACCCTGGCGG CGGAAACCAC CCGCCTCGGT CTGGAAGGAC GGGTCACCGG CTCGCACCTG ACCTCGATGC ATTCGATGGA CAGCTACTAC GTCTCGAAAC TGATCGCCCT GATGGCCGAG GCCGAGCTGG GGGTGATCGC CAATCCGCTG ATCAACATCA CCCTTCAGGG CCGCCACGAC GGCTATCCCA AGCGCCGGGG GATGACCCGG GTGCCCGAAT TGCTCGCCGC CGGTCTGACC GTGGCCTTCG GCCATGACTG CGTGATGGAC CCCTGGTATG GCCTGGGCAG CGCCGACATG CTTGAAGTCG CCGCCATGGG CCTTCACGTC GCCCAGATGA CCGGCCAGTC GGCGATGGCC GAGTGCTTCG CCGCCGTCAC CACGGCGCCG GCCAAGCTGA TGGGGCTCGA CGATTACGGC CTGATGCCCG GCTGTCGGGG CGATCTGGTG CTGCTGCAGG CCGGCGATCC GGTGGAGGCC CTGCGTCTGC GCGCGACCCG GCTGGCCGTG GTGCGCGGCG GCCGGATCAT CGCCCGCACC CCGGCGGCCA CCGCCACCTT GGATCTGGGC GACGGCGACC GCCCCACCCT AAACCCGGTG TCCTGGCGTG GCGGCGAGCG CCCGGGGTTC TAA
|
Protein sequence | MTTFPPAQDL IVRNASLPDG RRAQDILILA GTIKAIGPAL DAPEATPVLD AKGDLVAPPF VDAHFHMDSA LSYGLPRINA SGTLLEGIAL WGELKPELTH QAIVDRALAY CDWAIGRGLL AIRSHVDICD PRLLAVDALL EVREKVAPWL TLQLVAFPQD GFLRAQGSRE RLIAALDRGV DVVGGIPHFE RTMAQGAESV RALCEIAAER GLRVDMHCDE SDDPMSRHVE TLAAETTRLG LEGRVTGSHL TSMHSMDSYY VSKLIALMAE AELGVIANPL INITLQGRHD GYPKRRGMTR VPELLAAGLT VAFGHDCVMD PWYGLGSADM LEVAAMGLHV AQMTGQSAMA ECFAAVTTAP AKLMGLDDYG LMPGCRGDLV LLQAGDPVEA LRLRATRLAV VRGGRIIART PAATATLDLG DGDRPTLNPV SWRGGERPGF
|
| |