Gene Rsph17029_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2501 
Symbol 
ID4897668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2640027 
End bp2642180 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content74% 
IMG OID640113099 
Productsulfotransferase 
Protein accessionYP_001044375 
Protein GI126463261 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.957515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCCC TGAACCCTGC CCACCTTCCC CGGCTTCTGA CCGAGGCGCT GCGCCTGCAC 
GAGGCCGGCC GCCTGCCCGA GGCCGAAGAG CGCTATCGCT CCGCGCTGGC CGTGGCGCCG
GACGAACCCG CCGCCAACTT CCATCTGGGC CGCCTGCTCG CCGCCCGCCG TGCGCCCGAG
GCGCTGGACC GGCTGGCCCG CGCCGCCCGC GCCCGCCCGC AGGAGGCCGC GGGCTGGCAG
GCCTGGACCG CCGCCGCCGC GGCCTTCGGC GATGCCACCG CCCGCAAGGA GGTCATCGAG
GCCGCCCGGC AGGCGCGACT GCCCGCCCCG CTCCTGAGCC AGATCGAGAC GCGCCTTTCG
GGGGCGCCCG TGAAGCCCGT GGCGCAGCCC CGCATCGGCC GCGCCGCGCC GGCCGAGATC
AAGGCGCTGC TCTCCGACTA TCAGGCGGGC CGCATGGCCG CGGCCGAGCG CCGGGCCGGC
GCGATCCTCG CCGCCGCGCC CGATTGCGCG CTGGCCGCCG ACGTGCTGGG CAACGCGCAG
ATGGCGCAGG GGCGGCAGGC CCCGGCCCTC GCCGCCTTCC AGCGGGTCGT GGCGCTGGAG
CCCGGCTGGC CGGACGGGCA TCTGCATCTC GCGCAGGCGC TCTTGGCGCT CGGCCGGGCG
GCCGAGGCGC TCGATCCGCT CGCGCGCGCG GCGGCGCTGA CCCCGAAGCC CGCCCGGGCG
CTGATGCTGC TCGCGGTGAC GCTCGCGCGG ATGGGGCGGC AGACCGCGGC GCTGGCGGCG
CTGAAGCGCG CGGTCGCGGC CGAGCCCCGC CACCACGAGG CGCAGTTCCA GCTGGGCATC
CTGCAGACCG ACCTGCGCGC CTATGCCGAG GCGGAGGCCG CCTTCCGCGC GGCCGAGGCT
GCGGGCAATC GCTCGGCCGA CCTCCAGCTC AGGCTGGGAC AGGTGCTGCT CATGCAGGGC
GACGAGGCCG GCGCCGCGGC GGCCTACGAG ACGGGCCTCG CGCGCGAGCC CGATCATGCG
ATGCTCCTGT CGCGGAAGGC GCTGCTCCTG CAGGGCCGCG GCGCCTTCGC CGAGGCCGAG
GCGCTGCTGC GGCGCGCCAT CGAACTCGCG CCCGGCACCG GCGAGTTCTA CCGCATCCTG
TCGGCCAGCC TGAAGATGAC GCCCGAGGAT CCGCTGCTTC TCGAGATGCA GCGCCGGTTC
GACGACCCGG CCACGGCCGA GGCCGACCGG ATGCATCTGG GCTTCGCGCT GGCCAAGGCG
ATGGACGACC TGAAGCGGCC CGAGAGCCTG TTCACCTACC TGCGGCCCGC CAACCAGCTG
ATGCGCAAGG CGCACCCCTA CGACATCGCG AGCCGCCGGG CCGAGCTCGA GGGGATCTTC
ACCACCTTCG CCGACTTCAC CCCCGCCCCC GCCCCGGAGG CGAGCGACTT CGCCCCGGTC
TTCGTGACGG GAATGCCGCG CTCGGGCACC ACGCTCGTCG AGCAGATCAT CGCGAGCCAC
AGCCGCATGA CCGGCGCGGG CGAGGCGGGC GTGGCCGTGC GCGAGGTGCA GAAGGTGGTC
CTCGATGCCG CGGGCCGCTA CCGCGCCTGG AGCGAGATCG CGGGCGCGGA GGTGGCGGCC
ATGGGCCGGC GCTACGAGGC GGAGATGCGC CGCCGCTTCC CCGAGGCGGT GCAGGTCACC
GACAAGTCGA TCCAGACCTA TGCCTGGATG GGCTTCATTG CCTCGGCCCT GCCGCAGGCG
AAGTTCGTCG TCGTGCGCCG CGATCCGCGC GACACGGCGC TGTCGATCTA CCGCAACGTC
TTTGCCGAGA ACACGCATCT CTATGCCTAC GACCTGCGCG ACCTCGGACT CTATTTCCGC
ATGTTCGAGG AGCTGATCGA CTTCTGGCGC GAGAAGCTGC CGGGCGGCTT CCACGAGATC
CAGTATGAGG AGCTGGTGGA CCATCCCGAG GAGCAGTCGC GCCGCCTGAT CGCCGCCTGC
GGCCTGCCGT GGGAGGAGGC CTGCCTCAGC TTCCACGAGA ACAAGCGGCG GGTGCAGACG
CTGAGCCTCT ATCAGGTGCG CCAGCCGATC TATCGCAGCT CGACCCGCGC CTGGGAGCGG
CATGCGGAGG AGCTGAAGGA CTTTACCGAC GCGCTGGAGG GCCGACATGC TTGA
 
Protein sequence
MLPLNPAHLP RLLTEALRLH EAGRLPEAEE RYRSALAVAP DEPAANFHLG RLLAARRAPE 
ALDRLARAAR ARPQEAAGWQ AWTAAAAAFG DATARKEVIE AARQARLPAP LLSQIETRLS
GAPVKPVAQP RIGRAAPAEI KALLSDYQAG RMAAAERRAG AILAAAPDCA LAADVLGNAQ
MAQGRQAPAL AAFQRVVALE PGWPDGHLHL AQALLALGRA AEALDPLARA AALTPKPARA
LMLLAVTLAR MGRQTAALAA LKRAVAAEPR HHEAQFQLGI LQTDLRAYAE AEAAFRAAEA
AGNRSADLQL RLGQVLLMQG DEAGAAAAYE TGLAREPDHA MLLSRKALLL QGRGAFAEAE
ALLRRAIELA PGTGEFYRIL SASLKMTPED PLLLEMQRRF DDPATAEADR MHLGFALAKA
MDDLKRPESL FTYLRPANQL MRKAHPYDIA SRRAELEGIF TTFADFTPAP APEASDFAPV
FVTGMPRSGT TLVEQIIASH SRMTGAGEAG VAVREVQKVV LDAAGRYRAW SEIAGAEVAA
MGRRYEAEMR RRFPEAVQVT DKSIQTYAWM GFIASALPQA KFVVVRRDPR DTALSIYRNV
FAENTHLYAY DLRDLGLYFR MFEELIDFWR EKLPGGFHEI QYEELVDHPE EQSRRLIAAC
GLPWEEACLS FHENKRRVQT LSLYQVRQPI YRSSTRAWER HAEELKDFTD ALEGRHA