Gene RSP_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3174 
Symbol 
ID3721465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp226359 
End bp227858 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID640072850 
ProductNCS1 nucleoside transporter 
Protein accessionYP_354690 
Protein GI77465187 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGG CCGACCAGCT GCGACGCGCC GAGCACGACC GCTCGGCCCT GATCGAGGAA 
TCGATCCTGC CCACCCACCT CAACCAGCGT CCCATCGGGA TGATCGGCTA TGGCTGGATC
TGGGTCGGCA TCGCCGTCAT CATCGCCACC TACTCGCTGG GCGCCGCGGG CGTGGGCGGA
GGCGTGCCGC TTGCCACGGT GATCCTCACC ATCTTTGCGG CCAACCTCAC CATCGGCGCC
TTCATGCTGC TCACGGCCGA CATCGGCACC GAGCATGGCG TGCCCTTCGC GGTCTATCTG
CGGGCGCCCT TCGGCATCCA CGGCACCCAC CTGCCCTCGC TCTCGCGCGG GCTGGTGGCG
GCGATGTGGT TCGGCATCCA GACCTATCTC GGTGCGCTGG CCCTGAACGG GATCGGCGAA
TATGTCCTCG GCGTCTCGAA CTGGTTCGTC TGGTATCTCC TCTTCGGCCT CCTCCAGATC
GCGAGCACCA TGGCGGGCAT CCGCTCGGTC GAGCGGCTGG CGGCGCTGGC CGCGCCCGCG
ATCATCGCCA TCTCGGTCTG GATGTATTTC AGCCTCGAGG GGATCGCCGA GACCAAGGGG
CTCAACATCT GGACCTTCCG CGCCGAGGGG CAGATGTCGC TGCTCGCGCT CTTCATCGCC
AACCTCGGCT TCTGGTCGAC CATGGCCATC GACATTCCGA ACCTGACCCG CTTCATCGCG
GTGAAGCCCG GCGCGCGCGG CTTCCTCAGC CGCAACCGCG CGGTGTTCCT GGGCCAGCTC
GTGGCGCTGC CCGTCACTCA GGCGCTGGTC GCGGGCATCG GCGGGGTCTC GTTCATCGCC
ACCGGCAACT GGAACCCGAT CGAGGTGATC CAGGGGGATG CGCAGGGCCT CTCGCTCCTC
ACGCTTCTCC TCCTCGTCGT GCTGGCGCAA TGGTCGACGA ACAACTCGGC CAACCTGATC
CCGGCCGCAC TGACCTTCGT CAACCTCGCG CCGCGCCGGA TCGATTACCG CATCGGCGTA
GCGCTGGCGG GGGTCGTGGG CACGCTCTGC TTCCCGTGGG AGATCCTGAA CAACCTCTTC
ACCTTCCTCG GCTACTGCGG CGCCTTCCTG CTTTCCATCG GCGGGATCAT GGTGGCGGAT
TACTATGTGC TGCGCGGCCG GCGGGTGAAC GTGCCTGCCC TCTACGATCC GCAGGGCCAG
TACCGCTACG CGGGCGGCTT CAATCCCGCG GGCCTCGTGG CCTGGATCGT CGCGGGGGCG
GCGGCAGCCT GGTGGTCGGA CTATTCCGTC TTCGTGGGCT TCCCGCTGGG CGCCCTCCTT
TATCTCGCGC TGATGAAGCT CGTGGTGCTG CCGCGCCATC CGCAGCCCGA GATGGCCGCG
GCCGAGGGCT ATCTCGCCAC CTCCGAGGGG GTGAGCTGGG CCTATCTCGG CGGCGGCCGG
TTCACCCGCC TGCGCCCCGG CGAGACGGCG GGCGCGGTCG TCCCGCGCGA GGATCTGTAA
 
Protein sequence
MTLADQLRRA EHDRSALIEE SILPTHLNQR PIGMIGYGWI WVGIAVIIAT YSLGAAGVGG 
GVPLATVILT IFAANLTIGA FMLLTADIGT EHGVPFAVYL RAPFGIHGTH LPSLSRGLVA
AMWFGIQTYL GALALNGIGE YVLGVSNWFV WYLLFGLLQI ASTMAGIRSV ERLAALAAPA
IIAISVWMYF SLEGIAETKG LNIWTFRAEG QMSLLALFIA NLGFWSTMAI DIPNLTRFIA
VKPGARGFLS RNRAVFLGQL VALPVTQALV AGIGGVSFIA TGNWNPIEVI QGDAQGLSLL
TLLLLVVLAQ WSTNNSANLI PAALTFVNLA PRRIDYRIGV ALAGVVGTLC FPWEILNNLF
TFLGYCGAFL LSIGGIMVAD YYVLRGRRVN VPALYDPQGQ YRYAGGFNPA GLVAWIVAGA
AAAWWSDYSV FVGFPLGALL YLALMKLVVL PRHPQPEMAA AEGYLATSEG VSWAYLGGGR
FTRLRPGETA GAVVPREDL