Gene RPC_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1969 
Symbol 
ID3973642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2143102 
End bp2144091 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content66% 
IMG OID637925080 
Productperiplasmic binding protein 
Protein accessionYP_531845 
Protein GI90423475 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4558] ABC-type hemin transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC GCGCTTTCAA ACCGTTTTCA TGGTCCGCGG GACGCCACTT CGCGTTCGTG 
GCGACGATGA TCTGCGGCGT GGCCGCGCCC CTCGCGGCCG TCCACGCCGG CGGCGTCGTG
GTCCGCGACG CGCGAGATCG CGATGTGGAG ATCGAAGATC CCTCGCGCAT CATCGCCATC
GGAGGTTCGA TCACCGAGGT TCTGTTCGCG CTCGGTCTTG ACGGCCGGAT CGCGGGAGTG
GATTCCACCA GCCTGTATCC GCCGACTGCG CTGCAAGAAA AGCCCAATGT CGGCTATCTG
CGGCAGCTGT CCCCGGAAGG CGTGATCGGG TTGAACCCGA CGCTGATCCT CGCCATGCAG
GGCGCAGGCC CGAAGGAAAC CATGCAGGTG ATCGAAGCCG CACGAATTCC GCTGGTCGTG
GTTCCGGAGG ATTTCTCCGA GCAAGGCCTG CTCGACAAGA TCAGCCTGGT CGGCCACGCC
ATGGGCGCCG ATCGCGGCGC CGCCTGCCTC ACCGCCGCGG TGTCCGGCGA TCTGGCGAAA
CTGCGCGAGC TGCGCGCCAG GGTGACGAAG CCGGTGCGGG TCATGTTCGT GATGGCGCTG
GTCAATGGCC AGGCGATGGC CGCCGGCCAC AACACCGCGG CCGACGAGAT CATCAAGCTG
GCCGGCGGCA TCAATGCGGT CGACGGCTAT GACGGCTACA AGCTGATCAA CGACGAAGCC
ATCGTCGCGT TGCGGCCGGA GGTAGTGCTG TCGATCCAGC GCAGCAAGGA TTCGCTCGAG
GCCGAGGCGA TCTACCATCA TCCCGCCTTC GCGCTGACAC CGGTGGCCGC GAACAAGGCC
TTCATCTCGA TGGAGGGCCT CTATCTGCTG GGCTTCGGCC CGCGCACCGC GGCTGCCGCC
CGTGACGTCG CGGCGAAACT ATATCCGGAG CTCGCGGACG AAGCTGCGAA ATTCCAATCC
GCGGCGTTGA CGGCGAACTG TCGCCAATGA
 
Protein sequence
MSRRAFKPFS WSAGRHFAFV ATMICGVAAP LAAVHAGGVV VRDARDRDVE IEDPSRIIAI 
GGSITEVLFA LGLDGRIAGV DSTSLYPPTA LQEKPNVGYL RQLSPEGVIG LNPTLILAMQ
GAGPKETMQV IEAARIPLVV VPEDFSEQGL LDKISLVGHA MGADRGAACL TAAVSGDLAK
LRELRARVTK PVRVMFVMAL VNGQAMAAGH NTAADEIIKL AGGINAVDGY DGYKLINDEA
IVALRPEVVL SIQRSKDSLE AEAIYHHPAF ALTPVAANKA FISMEGLYLL GFGPRTAAAA
RDVAAKLYPE LADEAAKFQS AALTANCRQ