Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3684 |
Symbol | |
ID | 3837140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 4231018 |
End bp | 4232646 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637827808 |
Product | sulfotransferase |
Protein accession | YP_428765 |
Protein GI | 83595013 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.843363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGCC ATCACGCGGC GGCGCGCGCC TTGCGCCGTC AGGGGCGCCT AGGCGAGGCG ATCGCCCGCT ATCGCGCGGC CCTGGCCGCC ACCCCGGGCG CGGCCGTTCT CCAAGCCGAA ATGGCCGAAT GCCTGTTCGC CGCCGGTCAG GCCGAAGCGG CGGTGAAGGC GCTGGGTCAG GCGGTCCGGC TCGACCCCGA TAACGCCACC CACCGCGGCA ATCTGGCGAT GCTGCTGGCC CGCAAGGGCG ATCTGGCCGG CGCCATTGAT CATGCCCGGG CGGCGGTGCG CCTCGCCCCC AAGGACGGCG CCTTGCGCCT GCGTCTGGCC CGTCTGCTGA TCGCGGCCGG CCATCCCCGC GAGGCCGAGG CCGAGGCCCT GGACGCCACC CGCCGTCTGC CGCGCGAGGC CGCCAGTTGG ATTGCCCTGG CCGGAGCCCG GCTGCTTGAC CAGCGCCCCA ATGACGCCGA AGCGCCCAGC GCCCGCGCCC TAGCCCTGGC CCCCAATGAC GCCGAGGCGC TCAGCCTGCG CACCGATCTG CTGCTCAGCC TCGGCCGTCT GGCCGAGGCC GAGGCGACGG CCCGCCTTGC CTTGAAGCGG CAACCCACGT CGCTGGCCGC CCTGGTCGCG CTGTCGAAAG CCAAGACCTT CCGCCCCGAC GACCCGGACT GGCCGGCCCT GGCGGCCCTG CTGCCCAGCC TGGGCGAGAG GAGCGCCGAA GAGGCGGTAA AGCTGCATTT CGCCAGCGCC AAGGCGCTGG AGGACATGGG CCGCGACGAC GAGGCCTTCG CCCATTATCA GGCGGGCAAC AGCCTGAAGG GCAGGGGCCT GCCCGATGAA CTGCCGGCGC TGAGCGCCAT GGTCGACAGC CTGGAACGCT GGACGCCCAA GCTGCGGCCG GTCGGCGAGG GCGATCCTTT GCCGGTGTTC ATCGTCGGCA TGCCGCGCTC GGGCACCACG CTGGTCGAAC AGATCCTTGA CCGCCACCGG GCGATCCATG GCGCCGGCGA GATCTTGCTG TTTGGCGAAA GGGTGGTCGC CAACGGCCTG GGCGGCTATA GCGCCGATCC GCAAGGCCTT GATCCCGAGC GGCTAGCGGC TTTGGGGGCG GATTATCGCG ACCGCCTGCG CGGCCTCGCC CCCCGGGCCG GGCGCATCAT CAACAAGACC CCGGGTAACT GGCTGCATCT GGGGCTGATC GCCGCCGCCC TGCCCGGCGC CAGGATCATC TGGTGCCGGC GCGATCCCGT CGATTGCTGC CTGTCGTGCT TTCGCAATCT GTTCGGCCAG GGCCACGCCT GGACCACCGA TCTGGGGCGG GCCGGGCGCT ACTATCGCCT TCAAGAGCGG CTGACCGGCC ACTGGCAGGC CGTGCTGGGC GATGAGCGCA TGACCGCCGT TGATTACGAG GCCCTGGTCG CCGACCCGGA GGCCGAGGCC CGCCGGCTGG TCGCCCACCT TGGCCTGGAG TGGGACGAGG CCTGCCTTGA CCATACCCGG GGCGGGCGGG CGGTCACCAC CTTGTCCCAG GTCCAGGTCC GCCAGCCGAT CACCGACGCC TCCGTCGGTC GCGGCCGGCG GTTCCAGACC CACCTCGGGC CGCTTCTCAC CGCCCTGGAC GGGCGCTGA
|
Protein sequence | MQGHHAAARA LRRQGRLGEA IARYRAALAA TPGAAVLQAE MAECLFAAGQ AEAAVKALGQ AVRLDPDNAT HRGNLAMLLA RKGDLAGAID HARAAVRLAP KDGALRLRLA RLLIAAGHPR EAEAEALDAT RRLPREAASW IALAGARLLD QRPNDAEAPS ARALALAPND AEALSLRTDL LLSLGRLAEA EATARLALKR QPTSLAALVA LSKAKTFRPD DPDWPALAAL LPSLGERSAE EAVKLHFASA KALEDMGRDD EAFAHYQAGN SLKGRGLPDE LPALSAMVDS LERWTPKLRP VGEGDPLPVF IVGMPRSGTT LVEQILDRHR AIHGAGEILL FGERVVANGL GGYSADPQGL DPERLAALGA DYRDRLRGLA PRAGRIINKT PGNWLHLGLI AAALPGARII WCRRDPVDCC LSCFRNLFGQ GHAWTTDLGR AGRYYRLQER LTGHWQAVLG DERMTAVDYE ALVADPEAEA RRLVAHLGLE WDEACLDHTR GGRAVTTLSQ VQVRQPITDA SVGRGRRFQT HLGPLLTALD GR
|
| |