Gene RPC_2525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2525 
Symbol 
ID3970991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2735714 
End bp2737174 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content67% 
IMG OID637925633 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_532395 
Protein GI90424025 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.839724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.154925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCCA GCCTCGAACG ATTGTTTTCC AGACAGATCG TGGATGGCCC TGACCCGGTG 
GGACCCACGG TGCAATCGGT ATCGCGTTCT CGACGACAAA AAGCCGCCTT CGTGGTGCTG
GCCGCGGTGC TCGCCGTGGT CGGCGCGGTG GCCGGCGCCT ATTACTTTGC GATGCGGCCG
GAGACGCTGC GCATTGCGGT GGGACCGCAG AATTCCGACG ACGTCCGGGT GGTGCAGGCG
CTGACCCAGG CGTTCGCCCG CGACCACAAT TCGGTGCGGC TGCGCCCGGT GTTGACCGAC
GGCGCGCTCG GCAGCGCCAC CGCGCTGGCC GACGGCAAGG CCGACCTGGC GGTGATCCGC
GGCGACCTCG AAGTGCCGAA GAACGCCCAG GCGGTGGCGG TGCTGCGCAA GAACGTCGCG
GTGCTGTGGG TGCCGGCCAA GCCGAAGTCC GCGTCCAAGG CCAAGAAGGC CGCGGCCAAG
ATCAGCAAGA TCGCGCAGCT CGACGGCAAG CGGATCGGCA TCATCGGCCG CACCCAGGCC
AATGTGAACC TTTTGAAAGT GGTGCTGCAG CAATACGGCG TCGATCCGGC CAAGGTCGAC
ATCGTGCAGT TCTCCACCAG CGAGGTCGCC GAGGCGATCA AGGACCAGAA GGTCGACGCC
TTCCTGGCGG CCGGCCCGGT CAACAGCAAG ATTACCGCCG ACGCCATCGC CGCCTCGATC
CGGGAGGGCG GCGCGCCGAC TTTCCTGCCG ATCGATTCCG CCGAGGCGAT CGCGCAGAAC
CATCCGATGT ACGAGGCCGC GGAAATTCCC GCCGGGGTGT TCGGCGGCGC GCCGGCGCGG
CCGGAGGACG AGGTCAAGAC CATCAGCTTC GCGCACCACA TCGTGGCGCG GAAGGGGCTG
CCGGACCCCA CCGTCTCGGC GTTCACCCGG CAATTGTTCG CGATCCGCCA GACCGTGATG
AACGAATTCC CGCTGGCGGC GAAGATCGAG ACCCCGGATA CCGACAAGGA CGCCGCGATC
CCGGTGCATC CGGGTGCTGC GGCGTATGTT GACGGCGAGG AGAAGACCTT CCTCGACCGC
TACAGCGATT ACATCTGGTG GTCGCTGATG GGCGCCTCGG CGCTGGGCTC GATCGGCGCC
TGGTTCGCCG GCTATCTGCG CAAGGACGAG CGCACCAACA ACTCCTGGCT GCGCGAGCGG
CTGCTCGACA TGATCGCGAT GGCCAGGAAG AGCGATTCGA CCGACGAACT CGACGCCATG
CAGGCCGAGG CCGACAACAT CCTGCGCGAT ACGCTGACCT GCTTCGAGAA CGGCGCGATC
GAGGAGGGCA CCCTGACCGC GTTCAACATC GCGCTGGAGC AGTTCCACAA CGCGGTGGCC
GACCGCAAGG CGCTGCTGGC GCTGGCGCCG CAAAGCCCGC CGCGCGCCAA CGTGCAGTTG
CTGGCGGGCA GCGGCCTGTA G
 
Protein sequence
MGSSLERLFS RQIVDGPDPV GPTVQSVSRS RRQKAAFVVL AAVLAVVGAV AGAYYFAMRP 
ETLRIAVGPQ NSDDVRVVQA LTQAFARDHN SVRLRPVLTD GALGSATALA DGKADLAVIR
GDLEVPKNAQ AVAVLRKNVA VLWVPAKPKS ASKAKKAAAK ISKIAQLDGK RIGIIGRTQA
NVNLLKVVLQ QYGVDPAKVD IVQFSTSEVA EAIKDQKVDA FLAAGPVNSK ITADAIAASI
REGGAPTFLP IDSAEAIAQN HPMYEAAEIP AGVFGGAPAR PEDEVKTISF AHHIVARKGL
PDPTVSAFTR QLFAIRQTVM NEFPLAAKIE TPDTDKDAAI PVHPGAAAYV DGEEKTFLDR
YSDYIWWSLM GASALGSIGA WFAGYLRKDE RTNNSWLRER LLDMIAMARK SDSTDELDAM
QAEADNILRD TLTCFENGAI EEGTLTAFNI ALEQFHNAVA DRKALLALAP QSPPRANVQL
LAGSGL