Gene RPD_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2232 
Symbol 
ID4022717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2495632 
End bp2496882 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID637962427 
ProductNa+ dependent nucleoside transporter-like 
Protein accessionYP_569368 
Protein GI91976709 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.363094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAC TGCAATCTGC TTTCGGAATC TTCGCGCTGC TGATGCTCGC CTGGGCGTTC 
GGCGAGCACC GCGACCGCGT CTCGGTTCGG ACGACGGCGA TCGGAATAGC GGTGACGCTC
GCAACCGCCG CGCTGATGTT GAAGCTGCCC GGCGCCGTGC ACGTCTTCGG CGCCGTCAAC
GACGCGGTCG GGGTGATCGG CGCGGCGACG CGCGCCGGCA CCTCGTTCGC GTTCGGCTAT
CTCGGCGGCG GTCAGGCGCC GTTCGACATC AGGGCGCCCG GCGCCGATTT CATCCTGGCG
TTCCAGGCGC TGCCCGTGGT GCTGGTGATG AGCGTGCTCA CCACGCTGCT GTTCTACTGG
AAGATCCTGC CGCCGATCGT GCACGGCATG GCGTGGATTC TGGAGCGCAC GCTCGGCGTC
GGCGGCGCGG TCGGGCTGTC GACCGCGGCC AATGTGTTTC TCGGCATGGT CGAAGCGCCG
CTGTTCATCA GGCCGTATCT GGCGCAGCTC ACCCGCAGCG AACTGTTTCT GGTGATGACC
GGCGGCATGG CCGGCATCGC CGGCACCGTG CTGGTGCTGT ACGCGACCCT GCTCGCGCCG
ATCCTGCCCG ACGCCGCGGC CCATTTCGTG ATCGCCTCGG TGCTCGGCGC GCCGGCGGCG
ATTCTCGTCA GCCTGATCAT GGTGCCGGAA ACCGAACCGC GCGCGACCGG CGCCGCGCTC
GCCGATCCTG AAGCGATCGC CAAGAGCGGC ATGGACGCCG TGGTCAAAGG CACCGCTGCC
GGCCTCGAAC TGCTGCTCAA CATCATTGCC ATGCTGATCG TGCTGGTGGC GCTGGTCTAT
CTCGCCAACG CAGCTCTCGG CCTGCTGCCA CAGATCGGCG GCGCGCCGAT CTCGCTGCAG
CGATTGCTGG GCTATGCGAT GGCGCCGATC TGCTGGCTGC TCGGCCTGCC GTGGCCGCAG
GCGGTCACCG CCGGCGCGCT GATGGGCATC AAGACCGTGC TCAACGAGCT GATCGCCTAT
GTCGAACTCG CCAAGCTCGC GCCCGACGCG CTGGATCCGC GCTCGAAACT GATCATGCTG
TACGCGATGT GCGGCTTCGC CAATTTCGGC AGCCTCGGCA TCATGCTCGG CGGCCTCACC
GCGATGGCCC CGCAACGCCG CGAGGAGATC GCCTCGCTCG GCCTGAGGTC GATCGTCTCC
GGCACCCTCA CCACCTGCCT GATCGGCGCG GTGGTGGGAG TGATGACGTG A
 
Protein sequence
MLQLQSAFGI FALLMLAWAF GEHRDRVSVR TTAIGIAVTL ATAALMLKLP GAVHVFGAVN 
DAVGVIGAAT RAGTSFAFGY LGGGQAPFDI RAPGADFILA FQALPVVLVM SVLTTLLFYW
KILPPIVHGM AWILERTLGV GGAVGLSTAA NVFLGMVEAP LFIRPYLAQL TRSELFLVMT
GGMAGIAGTV LVLYATLLAP ILPDAAAHFV IASVLGAPAA ILVSLIMVPE TEPRATGAAL
ADPEAIAKSG MDAVVKGTAA GLELLLNIIA MLIVLVALVY LANAALGLLP QIGGAPISLQ
RLLGYAMAPI CWLLGLPWPQ AVTAGALMGI KTVLNELIAY VELAKLAPDA LDPRSKLIML
YAMCGFANFG SLGIMLGGLT AMAPQRREEI ASLGLRSIVS GTLTTCLIGA VVGVMT