Gene RPC_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3646 
Symbol 
ID3972017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4055091 
End bp4056416 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content63% 
IMG OID637926755 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_533500 
Protein GI90425130 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.17089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.182494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAT TTACCCCGGA TCGCCGAACC TTGCTGAAGG GCAGCGCGCT GACGCTGGCC 
GCCGCGGCGA CGATGTCGGC CGAGCAATTG CTGGGCTATG CCAAGGCCTG GGCGCAGTCC
GCGCCGTGGA AGCCGGAAGC CGGCGCCAAG ATCAACCTGC TGCGCTGGAA GCGCTTCGTC
GAAGCCGAGG ACGTCGCCTT CATGAAGATC GTCGAGGCGT TCCAAAAGGC CACCGGCTGC
GCCGTCAGCG TCTCCAACGA ATCCTATGAC GACATTCAGC CCAAGGCCTC GGTGGCGGCG
AACACCGGGC AGGGACTCGA CATGGTGTGG GGGTTGTATT CGCTGCCGCA TTTGTTGGGT
AACAAGGTCA CCGACGTCGC CGACGTCGCG AATTATCTCG GTGGCAAATA CGGCGGCTGG
ACCAAGTCGG CCGAGGATTA CTGCAAAGTC GGCAACAAAT GGGTCGGCGT GCCGATCGCC
ACCACCGGCG CGCTGATCAA CTACCGCATC GCCGCCTGCG AAAAGGCCGG CTTCAAGGAA
TTTCCGAAGG ACACCGCGGG CTTCTTGGAA TTGTGCAAGG GGCTGCAGAA GAACGGCACC
CCGGCCGGCA TGGCGCTCGG CCACGCCTCG GGCGACGCCA ACACCTGGCT GTATTGGGCG
CTGTGGACGT TCGGCGGCAA TCTGGTCGAC GCCAACAACA AGGTGGTGAT CAACTCGCCG
GAAACCGCGG CCTCGCTGGA ATATATCAAG CAGCTCTACG GCACGTTCAT CCCCGGCACG
GTGTCGTGGA ACGATTCCTC CAACAACAAG GCGTTCCTCG GCGGGCAGTT GCACCTCACC
GTGAACGGCA TTTCGATCTA CGTCACCGCG AAACGCGAGG CGCCGGCGAT CGCCGAGGAC
ATGAACCACG CCTATATGCC GATCGGCCCC TACGGCAAGC CGAGCGAAAT GCATCTGGCG
TTCCCGATGC TGATCTTCAA TTTCACCAAG TATCCGCAGG CCTGCAAGGC GTTCACCGCC
TTCATGCTGG AAGCGCCGCA GTTCAATCCG TGGATCGAGG CGGCGCAGGG CTATCTGTCG
CACTTCCTCA ACGCCTACGA CGCCAACCCG ATCTGGACCG CCGACCCCAA GACCACACCG
TATCGCGACG TCGCCAAGCG CGCCCGCACG CCCGCCGGGC TCGGCACGCT CGGCGAGAAC
GCGGCGTCGG CGATCGCCGA CTTCATCCTG GTCGACATGT TTGCGAACTA TTGCACCGGC
CGCGAAGACG TGAAGGGCTC GATCGCCTCG GCGGAACGGC AATTGAAGCG GATCTATCGG
GCGTGA
 
Protein sequence
MTGFTPDRRT LLKGSALTLA AAATMSAEQL LGYAKAWAQS APWKPEAGAK INLLRWKRFV 
EAEDVAFMKI VEAFQKATGC AVSVSNESYD DIQPKASVAA NTGQGLDMVW GLYSLPHLLG
NKVTDVADVA NYLGGKYGGW TKSAEDYCKV GNKWVGVPIA TTGALINYRI AACEKAGFKE
FPKDTAGFLE LCKGLQKNGT PAGMALGHAS GDANTWLYWA LWTFGGNLVD ANNKVVINSP
ETAASLEYIK QLYGTFIPGT VSWNDSSNNK AFLGGQLHLT VNGISIYVTA KREAPAIAED
MNHAYMPIGP YGKPSEMHLA FPMLIFNFTK YPQACKAFTA FMLEAPQFNP WIEAAQGYLS
HFLNAYDANP IWTADPKTTP YRDVAKRART PAGLGTLGEN AASAIADFIL VDMFANYCTG
REDVKGSIAS AERQLKRIYR A