Gene RPC_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3657 
Symbol 
ID3972029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4072992 
End bp4074245 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content50% 
IMG OID637926767 
Productperiplasmic protein-like 
Protein accessionYP_533511 
Protein GI90425141 
COG category[S] Function unknown 
COG ID[COG3904] Predicted periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.568365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATAA CGTGCGTATG TCGCGCCTCA ATGGGGTCAC ACCAAAGATT TGGATTGAAT 
GTGTTGCCAA GGCTACTGAT AGACCGATTT TCGTTGTGTT GTGTCTTGCT CACAATTGCG
ACCCCCACGT ACGGCGCGGA GAAGTTCGAT TACGACATCA AATCGCCGAT GGTATTTCTT
GCTGGTTACA ACGGCGGCAA TTGCAATGGT TGCGAATGGA TTATTGCCGA AGGAACGATT
ACTTCGGAAA CACCCGCGCT GTTTAAAGCC TACCTAGAAA AGAAGAAGCC AAATCGCGGT
GCATCCATTC AATTGAATTC TCCTGGCGGG AATCTGATAG CGGGGATTAA GCTTGGCGAA
CTGATCCGGT CTGCCGGGCT CATCACGTTG GTCGGAAAGA CCGTAGGTGA GCGGGCGTCC
TTCGATCGAG AAGCCATAAA AGATGAAGGC GAAGTAGCTC CCGAAAGCGC GAAGAGAGAG
CCCATTTGTG CCTCTGCGTG CGTATATGCA TTTGTAGGCG GTGAGACGCG GTTTGCCACT
AAGGCAAAGA TAGGAGTTCA CCAGTTCTAT GATGGCAAAT CGGCTAGCGA CCCTCTCGCA
AAGACAGCGA GCTCCATTGA TCGATCCGCA GATCAGCTGT TGGCTGGCCT CCTCCTCGAA
TATGTAATCA GGATGGGAAT TGATCCAAAG CTTATCGCAA TAGCGAGTTC AGTCCCTCCG
TGGGGCGAAA TGAAATGGCT TACGGACCAA GAACTTACCG AGTTGAAGAT CGACAATTCG
GAAGTATCAT ACACGCCCCT TTCTGTCGAA CCTTTCGGGA CGCAAGGTTC ATTTGCGGAA
ACCAGGAGCA GATCAATCTA CTATAGCTTC CACCACCGTA TTTACTGCAA GGATCGACCA
GACAGCGTTT ATCTCGCCTT CAGTTTCGAC GCAAAAGGTG GCAACGCTGA TTATGTGAAG
AGTATGTTTG AAGGAGTTCT ATCAAGCTCA TCGATTGAAC TTGGCACCAG CAAAGGCGCA
CAGTTCTTTC CCTCCAAACT GTTTGGCGTA GTTGTTACAA AAGGGGAGCA ACCAACGGTC
CAAGCCTCTA CCTTGGTCGT AGGAGCGACG ATGGCAGACT TTCAAGCTGC TGATCGCGTC
TCCATCAACA GCAACCTCTC TAAGCACGAG GGCAATATGG CGTTCTGGAT GTCGTTTCCC
ATTAAGGGCG ACCGCCGAAA GATTGGAATC GTAGCTCGAT CCTGTGTTAA ATGA
 
Protein sequence
MRITCVCRAS MGSHQRFGLN VLPRLLIDRF SLCCVLLTIA TPTYGAEKFD YDIKSPMVFL 
AGYNGGNCNG CEWIIAEGTI TSETPALFKA YLEKKKPNRG ASIQLNSPGG NLIAGIKLGE
LIRSAGLITL VGKTVGERAS FDREAIKDEG EVAPESAKRE PICASACVYA FVGGETRFAT
KAKIGVHQFY DGKSASDPLA KTASSIDRSA DQLLAGLLLE YVIRMGIDPK LIAIASSVPP
WGEMKWLTDQ ELTELKIDNS EVSYTPLSVE PFGTQGSFAE TRSRSIYYSF HHRIYCKDRP
DSVYLAFSFD AKGGNADYVK SMFEGVLSSS SIELGTSKGA QFFPSKLFGV VVTKGEQPTV
QASTLVVGAT MADFQAADRV SINSNLSKHE GNMAFWMSFP IKGDRRKIGI VARSCVK