Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3656 |
Symbol | |
ID | 3837112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 4196060 |
End bp | 4198018 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637827780 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_428737 |
Protein GI | 83594985 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.219229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTCGC TGTTTACCCC GATGCCGCCC CGGGCGCGCA TCGTCTTCGC CCACGACGTG GTGATGGCGG CGCTGTCCTT CGTGATCTCG CTGTATCTGC GCCTGGGTGA CGGCCTGATC CGCTATGTGC CGCTACCCGA TCTGGGGCGC GAGGCCGCCC TGTTCGCCGC TTGCGCGGCG GTCGCCTTTT ACACCCAGCG CATGTATGTG GGCATCTGGC GCTATGCCTC GGTCGATGAC CTGATCGCCC TGTTGCGCGG CGCCACCCTG ACCCTGGTGC TGTTCCTGCC CGCCGCCTTT CTCATCACCC GCATGGAATA CGTGCCCCGG TCGGTGCTGG TCATCAATTG GGTGGTGCTG ATGGCCCTGC TGGGCGGCCC GCGCTTTTTG TACCGGCTGT TCAAGGACCG CAAATTCCAG CTGCGGCGCG CCCGCTCGCG CTCGGCGATC CCGGTCTTGC TGGTCGGGGC CGGTGACGGC TGCGAGATGT TCTTGCGCTC GGTCGCCCGC GAGACCGACA GCCCCTATCT GCCCGTCGGC ATCCTGTCCG AGCAGGAAAC CCGAGTCGGG CGCAATCTGC GCGGCGTCGA GGTTCTGGGC ACCTTGGATC GTCTGGCCGA GATCGTCGTC CGGCTTGAAC GCTGGGGCGA GCGGCCGCGC AAGCTGATCT TGACCTCCGA GACCCTCGAT CCCGCCCAGG TGCGCGGCCT GCTTGACGCC TGCGACGCCT TGGGGCTGTC GCTGGCCCGG TTGCCCCGGC TGATGGAGCT GCGCGACGGC GGCGCCGATA GCCTGGAGGT GCGCCCGGTG GCCATCGAGG ATCTGCTGGG CCGGCCCCAG GCCGTGCTTG ACCGCGCCCC CGTCGTCGCC CTGGTCGGCG GTCGCCGGGT GCTGGTCACC GGCGCCGGCG GCTCGATCGG CTCGGAGCTG GTGCGCCAGA TCGCCGCCCT CGACCCGGCC CGCCTGATCC TGGCCGACAG CAGCGAATTT GCCCTCTACA CCATCGATAT GGAGATCAAC GAGCGCTATC CGACGCTGTC GCGCCGGCCG GTGATCGCCG ATGTCCGCGA TGGCGACCGC GTCGATCGGG TGATGGCCGA GGAAAAGCCC GAGCTGGTGT TCCACGCCGC CGCCCTCAAG CATGTGCCGA TGGTCGAATT CAATCCGCTT GAGGGCCTGC GCACCAATGC CCTGGGCACG CGCACGGTGG CCGAGGCCTG CCGCCGTCAC GGCGTTGGCA CCATGGTGCT GATCTCGACC GACAAGGCGG TCAACCCGAC CAATGTGATG GGCGCCTCCA AGCGCATGGC CGAAAGCTGG TGTCAGGCCC TTGACCTCGC CGAGCGAACG AGCGCGGCGG CCACCCATTT CGTCACCGTG CGCTTTGGCA ATGTGCTGGG ATCGACCGGC TCGGTGGTGC CCTTGTTCCA GCGCCAATTG GCCGCCGGCG GTCCGCTGAC CGTGACCCAT CCCGAAATCA CCCGCTTCTT CATGACCATC CGCGAAGCCG TGGAGCTGGT CTTGCAGGCC GCCGCCCTGG GGTCGCGCGA GGAAAGCTAT CGCGGCTCGC TGTTCGTGCT GGATATGGGC GAGCCGGTGA AGATCGCCGA TCTCGCCCGC CAGATGATCC GTCTGGCTGG CCTCAAGCCC GAGGTCGATG TCCAGATCGC CTATACGGGC CTGCGTCCGG GCGAGAAGCT GTTCGAAGAG ATCTTCCACG GCAAGGAAAC CTCGCTGCCC ACGCCGACCG ACGGCGTGTT GGTGGCCGCG CCGCGCGTGC CCGATCCGGT GTTCCTGGGC ATCCAGTTCG ACGCCCTGGC CAGGGCCTGC GCGGCGGGCG ACGAGGCGCA GGCCCGCGCC GTGATCGCCG CCCTGGTCCC CGAATTCCAC AATCCCCCGG CCGAGACCAC GCCCCAACCC TTTGAGGGCG AGTGCGCCCA GGCGCGCGCC GTGCTATAG
|
Protein sequence | MRSLFTPMPP RARIVFAHDV VMAALSFVIS LYLRLGDGLI RYVPLPDLGR EAALFAACAA VAFYTQRMYV GIWRYASVDD LIALLRGATL TLVLFLPAAF LITRMEYVPR SVLVINWVVL MALLGGPRFL YRLFKDRKFQ LRRARSRSAI PVLLVGAGDG CEMFLRSVAR ETDSPYLPVG ILSEQETRVG RNLRGVEVLG TLDRLAEIVV RLERWGERPR KLILTSETLD PAQVRGLLDA CDALGLSLAR LPRLMELRDG GADSLEVRPV AIEDLLGRPQ AVLDRAPVVA LVGGRRVLVT GAGGSIGSEL VRQIAALDPA RLILADSSEF ALYTIDMEIN ERYPTLSRRP VIADVRDGDR VDRVMAEEKP ELVFHAAALK HVPMVEFNPL EGLRTNALGT RTVAEACRRH GVGTMVLIST DKAVNPTNVM GASKRMAESW CQALDLAERT SAAATHFVTV RFGNVLGSTG SVVPLFQRQL AAGGPLTVTH PEITRFFMTI REAVELVLQA AALGSREESY RGSLFVLDMG EPVKIADLAR QMIRLAGLKP EVDVQIAYTG LRPGEKLFEE IFHGKETSLP TPTDGVLVAA PRVPDPVFLG IQFDALARAC AAGDEAQARA VIAALVPEFH NPPAETTPQP FEGECAQARA VL
|
| |