Gene RPD_2625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2625 
Symbol 
ID4023122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2943144 
End bp2944976 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content65% 
IMG OID637962823 
Productextracellular solute-binding protein 
Protein accessionYP_569755 
Protein GI91977096 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.15211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000486729 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCCTGC GCGCGGGCGC GCTGGCGGCG GTGATGGTCG TTGGCGTCGT CGCTTTGTTC 
AGCGGCGTCG CGCAGGCCGG AGCAGAGGAC GCGGCGAAAC CGTCGCATGC GCTGGCGATG
CACGGCGAGC CTGCTCTGCC TGCCGACTTC ACCGCGATGC CCTATGTCAA CCCCGATGCG
CCGAAAGGCG GTCGTCTGGT GGAAGGTCTG CTCGGCACCT TCGACAGCCT CAATCCGTTC
ATCGTCAGGG GCATCGCGGT GCAGAGGATG CGCGGCTACG TCGTCGAGAG CCTGCTGGCG
CGCGGCAACG ACGAAGCGTT CACGCTTTAC GGCCTGCTGG CGCAATCGGT CGAGACCGAC
GATGCGCGCA GCTACGTCAC CTTTCGCATC GATCCGCGCG CGCGGTTCTC GGACGGCAAG
CCGGTGCTGG CGCAAGACGT GCTGTTCTCC TGGCAATTGC TGCGCGACAA GGGCCGCCCC
AATCATCGCA TTTACTACGC CAAGGTCGCG CGCGCCGAGG CGCCCGATCC GCGCACGGTG
CGGTTCGATT TCGGCGACGT GAACGATCGG GAACTGCCAC TGATCCTCGG CCTGATGCCG
ATCTTTCCCA AACACGCGGT CAACCCCGAC ACCTTCGAGG AGACGACGCT GGCGCCGCCG
ATCGGGTCAG GTCCGTACCG TGTCGGCGCG GTGAAGGCCG GCGCCAGCGT CACGCTGATC
CGGAACCCCG ACTATTGGGG GCGCGATCTT CCGATCAATC GCGGACTATG GAACTTCGAC
GAGATCCGGA TCGATTATTT CCGTGAGGCT AACTCACATT TCGAAGCCTT CAAACGCGAG
CTGTATGACT ACCGCGTCGA AAACGAGCCG CTGCGCTGGC ATGACGGCTA TGACTTTCCG
GCGGCCCGCA ACGGCGACGT GATCCGCGAC GCCTTCAAAA TCCGCATGCC GCAGCCGACC
GAATTTCTGG TGTTCAACAC CCGCCGTCCG GTATTCGCCG ATATCCGGGT CCGCGAAGCG
CTGTTGCAAT TGTTCGATTT CGCATGGATC AACCGCAACT ACTTTTTCGG CCTGTATGCG
CGCGCAGGTG GCTTCTTCGC GGGTTCGGAG CTCTCCGCCT ACACCCGCCC GGCGGAAGCC
GGCGAACTTC AACTGCTGAA GCCCTATCTG GCGCGACTGC GCGCCGACGT CATCGACGGC
AGCTACCGCC TGCCCGTCAG CGACGCCTCC GGCCGCGATC GCGCCACGCT CGGCCGGGCG
CTGTCGCTGC TGGCGGAGGC CGGCTATCAG CTCGACGGCA CGGTGCTGCG GCGGCGCGAC
AATCACCAGC CGCTGACCTT CGAAATCCTG GTCACCACGC GCGATCAGGA GCGCATCGCG
CTGGCCTTCG CCCGCGACGT CAAGCGTGTC GGCATCCAAA CCTCGGTCCG CGTGGTGGAC
GCGGTGCAGT TCGATCAGCG GCGGATCTCT TACGACTTCG ACATGATCCC CAACCGCTGG
GACCATTCGC TGTCGCCGGG CAATGAGCAA TCGTTCTATT GGGGTGCGGA AGCGGCCGAC
ACCCAGGGCA CCCGCAACTA CATGGGCGCG AAGGATCCAG CGATCGACGC CATGATCGCG
GCCATGATCG CGGCGCGCGG GCATCCGGAA TTCGTCGATG CGGTGCGGGC GCTCGATCGC
GTCCTGACCT CGGGCTTCTA CGTGATCCCG CTCTACAACA TCCAGGAACA ATGGATCGCG
CGTTGGAATC GGATAGAACG GCCGAAAGCG AACGCACTGA CCGGCTACCT GCCCGAGACC
TGGTGGGCCC GGCCACCGAC GCAGCAAAGG TGA
 
Protein sequence
MALRAGALAA VMVVGVVALF SGVAQAGAED AAKPSHALAM HGEPALPADF TAMPYVNPDA 
PKGGRLVEGL LGTFDSLNPF IVRGIAVQRM RGYVVESLLA RGNDEAFTLY GLLAQSVETD
DARSYVTFRI DPRARFSDGK PVLAQDVLFS WQLLRDKGRP NHRIYYAKVA RAEAPDPRTV
RFDFGDVNDR ELPLILGLMP IFPKHAVNPD TFEETTLAPP IGSGPYRVGA VKAGASVTLI
RNPDYWGRDL PINRGLWNFD EIRIDYFREA NSHFEAFKRE LYDYRVENEP LRWHDGYDFP
AARNGDVIRD AFKIRMPQPT EFLVFNTRRP VFADIRVREA LLQLFDFAWI NRNYFFGLYA
RAGGFFAGSE LSAYTRPAEA GELQLLKPYL ARLRADVIDG SYRLPVSDAS GRDRATLGRA
LSLLAEAGYQ LDGTVLRRRD NHQPLTFEIL VTTRDQERIA LAFARDVKRV GIQTSVRVVD
AVQFDQRRIS YDFDMIPNRW DHSLSPGNEQ SFYWGAEAAD TQGTRNYMGA KDPAIDAMIA
AMIAARGHPE FVDAVRALDR VLTSGFYVIP LYNIQEQWIA RWNRIERPKA NALTGYLPET
WWARPPTQQR