Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3132 |
Symbol | |
ID | 5083330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 3207221 |
End bp | 3209383 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640484704 |
Product | sulfotransferase |
Protein accession | YP_001169321 |
Protein GI | 146279162 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.915715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000116891 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCCCA TGCTGCCCCT GAACCCTGCC CACCTCCCAC GGCTTCTGAC CGAGGCGCTG CGCCTGCACG AGGCCGGCCG CCTGCCGGAG GCCGAGGAGC GTTACCGCTC GATGCTGGCC ATCGCGCCCG ACGATGCGGC GGCGAGCTTC CACCTCGGCC GTCTGCTCGC CACCCGCCGC GCGCCCGAGG CGCTGGACCA TCTCGGCCGC GCCGCCCGGA CCCGCCCGCA GGAGGCCGCC GTCTGGCAGG CCTGGAGCAC CGCCGCAGCC ACCTTCGGCG ACGGCGCCGC CCGCCAGTCC GTGATCGAGG CCGCGCGCGA GGCCCGCCTG CCTGCTCTGC TCCTGCAGCA GATCGAGGGG CGGCTCTCGG CCGGGCAGGC GAAGCCCGTG GCGCAGCCCC GCATCGGGCG CGCGGCGCCC GCCGAGGTGA AGGCGCTCCT GACCGACTAT CAGGCGGGGC GGATGCCGGC CGCCGAGCGG CGCGCCCTCG CGATCCTCAA GGCCGCTCCC GACTGCGCAC TCGCCGCGGA TGTGCTGGGC AATGCGCGGA TGGCGCTCGG CCAGCCCGCA ACCGCCCTCG CCGCCTTCCA GCGCGCCACG GCGCTCGAAC CGGGCTGGCC GGATGGGCAT CTCCATCTGG CGCAGGCGCT TCTCGCACTC GGACGCCCGG CCGAGGCGCT CGATCCGCTG GGTCGCGCCG CGAGCCTGTC CCGCAAGCCC GCCCGCGCGC TCATCCTGCT TGCGGTGACG CTGGCCCGGC TGGGCCGACC CGCGCAGGCG CTGGCGGCTC TCAAGCGCGC GGTGGCCGCC GAACCGGACC ATGCCGAGGC GCACCTCCAG CTCGGAATCC TGCAGACCGA CCTGCGCAAT CTGGCCGAGG CCGAAGCCGC CTTCCGCGCC GCCGAGGCGG CCGGCAACCG CTCGGCCGAT CTGCAGCTGC GGCTCGGGCA GGTGCGGCTG TTGCGGGGCG ACGAGGCGGG GGCGGACGCC GCCTACGACA AGGGGCTCGC GCGCGAGCCC GACCATGCGA TGCTGCTCTC GCGCAAGGGG CTGCTGCTGC AGGGCCGCGG CGCCTTCGCC GAGGCCGAGG CGCTGCTGCG CCGGGCGATC GCCCTCGCGC CGGACCGGGG AGAGTTCTAC CGCATCCTGT CGGCGAGCCT GAAGATGACG CCCGACGACC CGCTTCTGGC CGACATGCAG CGCCGCTTCG ACGACCCCGC CACGCCCGAG GCCGACCGGA TGCATCTCGG CTTTGCGCTG GCCAAGGCGA TGGACGACCT CAAGCGCCCC GAGAGCCTAT TCGCCTATCT GCACCCGGCC AACCGGCTGA TGCGCAAGGC CCACCCCTAT GACATCGCCA CCCGCCGGGC GGAGCTTGAC GGGATCTTCA GGACCTTCGC CGATTTCACC CCCGCGCCGG CGCCCGGTGC CACGGACTAC GCCCCCGTCT TCGTGACCGG CATGCCGCGC TCGGGCACGA CCCTTGTCGA GCAGATCATC GCCAGCCACA GCCGCATGAC CGGCGCGGGT GAGGCCGGCG TTGCCGCGCG CGAGGTGCAG AAGGTCGTGC TCGACGCCGA GGGCCGCCAC CGCCGCTGGG GCGACATCCC GCCCGAAGAG GTCGCCGCAA TGGGCCGCCG CTACGAGGCC GAGATGCGCC GCCGCTTTCC CGACGCGGCT CAGGTCACCG ACAAGTCGAT CCAGACCCAC GCCTGGATGG GCTTCATCGC CTCGGCCCTG CCCAAGGCGA AGTTCGTGGT CGTGCGCCGC GACCCGCGCG ATACCGCGCT CTCGATCTAT CGGAATGTCT TTGCCGAGAA CACGCATCTC TACGCCTATG ACCTGCGCGA CCTCGGCCTC TACTTCCGCA TGTTCGAGGA GCTGATCGAC TTCTGGCGCG AAAAGCTGCC GGGGGGCTTC CACGAGATCC AGTATGAGGA TCTGGTGGCC CATCCCGAAG AGGAGTCGCG CCGCCTGATC GCCGCCTGCG GCCTGCCGTG GGAGGATGCC TGCCTCAATT TCCACGAGAA CACGCGCCGC GTGCAGACGC TGAGCCTCTA TCAGGTGCGC CAGCCGATCT ACCGCAGCTC GACCCGGGCG TGGGAGCGCC ATGCCGCGGA CCTCAAGGAG TTCATCGACG CGCTGGAGGG CACGAATGCT TGA
|
Protein sequence | MDPMLPLNPA HLPRLLTEAL RLHEAGRLPE AEERYRSMLA IAPDDAAASF HLGRLLATRR APEALDHLGR AARTRPQEAA VWQAWSTAAA TFGDGAARQS VIEAAREARL PALLLQQIEG RLSAGQAKPV AQPRIGRAAP AEVKALLTDY QAGRMPAAER RALAILKAAP DCALAADVLG NARMALGQPA TALAAFQRAT ALEPGWPDGH LHLAQALLAL GRPAEALDPL GRAASLSRKP ARALILLAVT LARLGRPAQA LAALKRAVAA EPDHAEAHLQ LGILQTDLRN LAEAEAAFRA AEAAGNRSAD LQLRLGQVRL LRGDEAGADA AYDKGLAREP DHAMLLSRKG LLLQGRGAFA EAEALLRRAI ALAPDRGEFY RILSASLKMT PDDPLLADMQ RRFDDPATPE ADRMHLGFAL AKAMDDLKRP ESLFAYLHPA NRLMRKAHPY DIATRRAELD GIFRTFADFT PAPAPGATDY APVFVTGMPR SGTTLVEQII ASHSRMTGAG EAGVAAREVQ KVVLDAEGRH RRWGDIPPEE VAAMGRRYEA EMRRRFPDAA QVTDKSIQTH AWMGFIASAL PKAKFVVVRR DPRDTALSIY RNVFAENTHL YAYDLRDLGL YFRMFEELID FWREKLPGGF HEIQYEDLVA HPEEESRRLI AACGLPWEDA CLNFHENTRR VQTLSLYQVR QPIYRSSTRA WERHAADLKE FIDALEGTNA
|
| |