Gene RPC_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0404 
Symbol 
ID3970856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp437561 
End bp438877 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content63% 
IMG OID637923519 
Productextracellular solute-binding protein 
Protein accessionYP_530298 
Protein GI90421928 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTTC GACATGTCGG CCTGGCGGCG ACGCTTTCGC TTGCGTTGGG ACTCGCTTCG 
CCTGCGCTTG CTGCGACCGA GATCCAGTGG TGGCACGCCA TGACCGGCGC CAACAACGAC
GTCATCGTCA AGCTCGCCGA AGAGTTCAAC GCCGCGCAGA CCGACTACAA GGTGGTGCCG
TCCTATAAGG GCAGCTATCC CGACACCATG AACGCCGGCA TCGCGGCGTT CCGCGCCGGC
AACGCGCCGC ATATCATCCA GGTGTTCGAG GTCGGCACCG CCACCATGAT GGCAGCGACC
GGGGCGGTGA AGCCGGTCTA CAAGTTGATG GCGGAGGCCG GAGAGAAATT CGACTCGCAG
GCCTATCTGC CGGCGATCAC CGGCTACTAC TCGACCTCGA AGGGTGAGAT GCTGTCGTTC
CCGTTCAACT CGTCCTCGAT GGTGATGTGG ATCAACAAGG ACGCCTTGAA GAAGGCCAAC
ATCGCCGAGA TCCCGAAGAC CTGGCCGGAG GTGTTCGAAG ACGCCAAGAA ATTGAAGGCG
GCGGGCTACG CCACCTGCGG CTTCTCCACC GCCTGGGTGA CCTGGGCCAA TCTCGAGCAA
TTGTCGGCCT GGCACAACGT GCCGCTGGCG AGCCGGGCCA ACGGCCTCGA CGGCTTCGAC
ACCAAGCTCG AATTCAACGG CCCGCTGCAG ATCAAGCATC TGGAGACGCT GATCGCGCTG
CAGAAGGACA AGACCTACGA TTATTCCGGC CGCACCAACA CTGGAGAAGG CCGTTTCACC
TCCGGCGAAT GCCCGATCTT CCTGAGTTCC TCGGGCTTCT TCGGCCAGGT CAAAGGCAAC
GCCAAGTTCG ATTGGACCAA CGCGCCGATG CCGTATTATC CGGACGTTCA AGGCGCGCCG
CAGAACTCGA TCATCGGCGG CGCCTCGCTG TGGGTGATGG GCGGCAAGTC GCCGGCGGAA
TACAAGGGCG TCGCCAAGTT CCTCAGCTTC CTGTCCGACA CCGACCGTCA GGTCGCGATC
CACAAGGCCT CTGGCTATCT GCCGATCACC AAGGCGGCCT ACGCCAAGGC CCAGGAGGAA
GGCTTTTACG TCAACGCGCC GTATCTGGAG ACGCCGCTCA GGGAATTGAC CAACAAACCG
CCGACCGAAA ACTCCCGCGG ACTGCGGCTC GGCAACATGG TGCAGCTGCG CGACATCTGG
GCGGAAGAAA TCGAATCCGC GCTGGCCGGC AAGAAGACCG CCAAGGACGC GCTCGACACC
GCAGTGACCC GCGGCAACGC CATGCTGCGG CAGTTCGAAC GCACGGTGAG CAAGTAG
 
Protein sequence
MALRHVGLAA TLSLALGLAS PALAATEIQW WHAMTGANND VIVKLAEEFN AAQTDYKVVP 
SYKGSYPDTM NAGIAAFRAG NAPHIIQVFE VGTATMMAAT GAVKPVYKLM AEAGEKFDSQ
AYLPAITGYY STSKGEMLSF PFNSSSMVMW INKDALKKAN IAEIPKTWPE VFEDAKKLKA
AGYATCGFST AWVTWANLEQ LSAWHNVPLA SRANGLDGFD TKLEFNGPLQ IKHLETLIAL
QKDKTYDYSG RTNTGEGRFT SGECPIFLSS SGFFGQVKGN AKFDWTNAPM PYYPDVQGAP
QNSIIGGASL WVMGGKSPAE YKGVAKFLSF LSDTDRQVAI HKASGYLPIT KAAYAKAQEE
GFYVNAPYLE TPLRELTNKP PTENSRGLRL GNMVQLRDIW AEEIESALAG KKTAKDALDT
AVTRGNAMLR QFERTVSK