Gene Rsph17029_3911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3911 
Symbol 
ID4898982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1045760 
End bp1047259 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID640114514 
Productpermease for cytosine/purines, uracil, thiamine, allantoin 
Protein accessionYP_001045761 
Protein GI126464648 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.971412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTGG CCGACCAGCT GCGACGCGCC GAGCATGACC GCTCGGCCCT GATCGAGGAA 
TCGATCCTGC CCACCCACCT CAACCAGCGT CCCATCGGGA TGATCGGCTA TGGCTGGATC
TGGGTCGGCA TCGCCGTCAT CATCGCCACC TACTCGCTGG GCGCCGCGGG CGTGGGCGGA
GGCGTGCCGC TTGCCACGGT GATCCTCACC ATCTTTGCGG CCAACCTCAC CATCGGCGCC
TTCATGCTGC TCACGGCCGA CATCGGCACC GAGCATGGCG TGCCCTTCGC GGTCTATCTG
CGCGCGCCCT TCGGCATCCA CGGCACCCAC CTGCCCTCGC TCTCGCGCGG GCTGGTGGCC
GCGATGTGGT TCGGCATCCA GACCTATCTC GGTGCGCTGG CCCTGAACGG GATCGGCGAA
TATGTCCTCG GCGTCTCGAA CTGGTTCGTC TGGTATCTCC TCTTCGGCCT CCTCCAGATC
GCGAGCACCA TGGCGGGCAT CCGCTCGGTC GAGCGGCTGG CGGCGCTGGC CGCGCCCGCG
ATCATCGCCA TCTCGGTCTG GATGTATTTC AGCCTCGAGG GGATCGCCGA GACCAAGGGG
CTCAACATCT GGACCTTCCG CGCCGAGGGG CAGATGTCGC TGCTCGCGCT CTTCATCGCC
AACCTCGGCT TCTGGTCGAC CATGGCCATC GACATTCCGA ACCTGACCCG CTTCATCGCG
GTGAAGCCCG GCGCGCGCGG CTTCTTCAGC CGGAACCGGG CGGTGTTTCT GGGCCAGCTG
GTGGCGCTGC CCGTCACTCA GGCGCTGGTT GCGGGCATCG GCGGGGTCTC GTTCATCGCC
ACCGGCAACT GGAACCCGAT CGAGGTGATC CAGGGGGATG CGCAGGGCCT CTCGCTCCTC
ACGCTTCTCC TTCTGGTCGT GCTGGCGCAA TGGTCGACGA ACAACTCGGC CAACCTGATC
CCGGCCGCGC TGACCTTCGT CAACCTCGCG CCGCGCCGGA TCGACTACCG CATCGGCGTG
GCGCTGGCGG GGATCGTGGG CACGCTCTGC TTCCCGTGGG AGATCCTGAA CAACCTCTTC
ACCTTCCTCG GTTACTGCGG CGCCTTCCTG CTCTCCATCG GCGGGATCAT GGTGGCGGAT
TACTATGTGC TGCGCGGCCG GCGGCTGAAC GTGCCTGCCC TCTACGACCC GCAGGGCCAG
TACCGCTACG CGGGCGGCTT CAATCCCGCG GGCCTCGTGG CCTGGATCGT CGCGGGGGCG
GCGGCAGCCT GGTGGTCGGA CTATTCCGTC TTCGTGGGCT TCCCGCTGGG CGCCCTCCTT
TATCTCGCGC TGATGAAGCT CGTGGTGCTG CCGCGCCATC CGCAGCCCGA GATGGCCGCG
GCCGAGGGCT ATCTCGCCAC CTCCGAGGGG GTGAGCTGGG CCTATCTCGG CGGCGGCCGG
TTCACCCGCC TGCGCCCCGG CGAGACGGCG GGCGCGGTCG TCCCGCGCGA GGATCTGTAA
 
Protein sequence
MTLADQLRRA EHDRSALIEE SILPTHLNQR PIGMIGYGWI WVGIAVIIAT YSLGAAGVGG 
GVPLATVILT IFAANLTIGA FMLLTADIGT EHGVPFAVYL RAPFGIHGTH LPSLSRGLVA
AMWFGIQTYL GALALNGIGE YVLGVSNWFV WYLLFGLLQI ASTMAGIRSV ERLAALAAPA
IIAISVWMYF SLEGIAETKG LNIWTFRAEG QMSLLALFIA NLGFWSTMAI DIPNLTRFIA
VKPGARGFFS RNRAVFLGQL VALPVTQALV AGIGGVSFIA TGNWNPIEVI QGDAQGLSLL
TLLLLVVLAQ WSTNNSANLI PAALTFVNLA PRRIDYRIGV ALAGIVGTLC FPWEILNNLF
TFLGYCGAFL LSIGGIMVAD YYVLRGRRLN VPALYDPQGQ YRYAGGFNPA GLVAWIVAGA
AAAWWSDYSV FVGFPLGALL YLALMKLVVL PRHPQPEMAA AEGYLATSEG VSWAYLGGGR
FTRLRPGETA GAVVPREDL