Gene RPC_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3167 
Symbol 
ID3972603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3511342 
End bp3512592 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID637926277 
ProductNa+ dependent nucleoside transporter-like 
Protein accessionYP_533028 
Protein GI90424658 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.726241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.226636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCAAT TGCAATCGGC GTTTGGTGTC TTTGCACTGC TCATGCTGGC GTGGACGTTC 
GGCGAAAACC GCGGCGCGGT GTCGCTGAAA CCGGTCGTGA TCGGCCTGGC CGCGAGCCTG
CTCACCGCGG TGACGCTGTT GAAGCTGCCG ATCGTCGCGC ACGGCTTCGG CGTCATCAAC
GACGCCGTCG GGGTAATTTC CGCCGCCAGC CGCGCCGGCA CCAGTTTCGT GTTCGGCTAT
CTCGGCGGCG GTCCGCTGCC GTTCGATCCG AAGGCGCCCG GCGCCGGCTT CATCCTGGCG
TTCCAGGCGC TGCCGATCGT GCTGGTGATG AGCGTGCTGA CCACGCTGTT GTTCTATTGG
AAAATCCTGC CGCCGGTGGT GCGCGGCCTG GCGTGGGCGT TGCAGCGCAC GCTCGGCGTC
GGCGGCGCGG TCGGGCTGTC GACCGCCGCC AACGTGTTTC TCGGCATGGT GGAAGCGCCG
CTGTTCATCC GGCCGTATCT GGCGCAATTG ACCCGCGCCG AACTGTTCCT GGTGATGACC
GGCGGCATGG CCGGCATCGC CGGCACCGTG CTGGTGCTGT ACGCCACGCT GCTGGCGCCG
CTGATCCCCG ACGCCGCGGC GCATTTCGTC ATCGCCTCGG TGGTCGGCGC GCCGGCGTCG
ATCCTGATCA GCCTGATCAT GGTGCCGGAG ACCGCCGCGC AGCGCACCGG CGGCCTCGCC
GTCGATCCTT CAGGGCTGGC GTCGAGCACC ATGGACGCGG TGGTCAAGGG CACCAGCGCC
GGGCTCGAAC TGTTGCTCAA CATCGTCGCG ATGCTGATCG TGCTGGTGGC GCTGGTCTAT
CTGGTCAATG CCGGGCTCGG GCTGTTGCCG GCGTTCGGCG GCGAAGCGGT GTCGCTGCAG
CGGCTGCTCG GCTACGCGAT GGCGCCGGTG TGCTGGCTGC TCGGCCTGCC CTGGGACCAG
GCGGTCACTG CGGGATCGCT GATGGGCATC AAGACCGTGC TCAACGAACT GATTGCCTAT
GTCGAGTTCG CCAAGCTGCC GCCCGAGGCG CTGGATGCCC GCTCGCGGCT GATCATGCTC
TATGCGATGT GCGGCTTCGC CAATTTCGGC AGCCTCGGCA TCATGATCGG CGGGCTCGGC
ACCATGGCGC CGCAGCGCCG CGAAGAGATC GCCGCGCTGG GGCTGCGATC GATCGTGTCG
GGCACGCTGA CCACCTGTTT GATCGGGGCG ATTGTGGGGG TGATGACTTA G
 
Protein sequence
MLQLQSAFGV FALLMLAWTF GENRGAVSLK PVVIGLAASL LTAVTLLKLP IVAHGFGVIN 
DAVGVISAAS RAGTSFVFGY LGGGPLPFDP KAPGAGFILA FQALPIVLVM SVLTTLLFYW
KILPPVVRGL AWALQRTLGV GGAVGLSTAA NVFLGMVEAP LFIRPYLAQL TRAELFLVMT
GGMAGIAGTV LVLYATLLAP LIPDAAAHFV IASVVGAPAS ILISLIMVPE TAAQRTGGLA
VDPSGLASST MDAVVKGTSA GLELLLNIVA MLIVLVALVY LVNAGLGLLP AFGGEAVSLQ
RLLGYAMAPV CWLLGLPWDQ AVTAGSLMGI KTVLNELIAY VEFAKLPPEA LDARSRLIML
YAMCGFANFG SLGIMIGGLG TMAPQRREEI AALGLRSIVS GTLTTCLIGA IVGVMT