Gene RPD_2722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2722 
Symbol 
ID4023220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3039404 
End bp3040423 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content64% 
IMG OID637962921 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_569852 
Protein GI91977193 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.194387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTT CACGCAGGAC GTTGTTGAAG GCATCGGCGG CAGCGGCTGC GCTCGGTGGT 
ATCGGAATGC CGACTGTCGC GCGCGCGGCG GAGGCCGAGT TCGTCTACAA GTATGCCAAC
AACCTGCCCG ACACCCATCC GCTGAATGTC CGTGCGCGCG AGATGTCGGC GGCGATCAAG
GCCGAGACCA ACGGCCGGTT CGACCTTCAG ATCTTCCCGA ACAATCAGCT CGGTTCCGAC
ACCGACATGC TGAGCCAGAT CCGCTCCGGC GGCGTCGAGT TCTTCACGCT GTCCGGCCTG
ATCCTGTCGA CCCTGGTGCC GGCGGCCTCG ATCAACGGCA TCGGCTTCGC CTTCCCGGAT
TACGACACGG TCTGGAAGGC GATGGACGGC GAACTCGGCG GCCATGTCCG TGGCGAGATC
ACCAAGGCCG GCCTCGTGGT GATGGACAAG ATCTGGGACA ACGGCTTCCG CCAGACCACC
TCCTCGACCC GTCCGATCAA CGGCCCGGAA GATTTCAAGG GTTTCAAGAT CCGGGTGCCG
GTTTCGCCGC TATGGACCTC GATGTTCAAG GCGTTCGACG CCTCGCCGGC CTCGATCAAT
TTCAGCGAAG TCTATTCGGC GCTGCAGACC AAGGTGGTCG AGGGCCAGGA GAATCCGCTG
GCGCTGATTT CCACCGCCAA GCTCTATGAA GTGCAGAAGT ACTGCTCGCT GACCAACCAT
ATGTGGGACG GCTTCTGGTT CCTGGCGAAC CGCCGGTCCT GGGAGCGGCT GCCGGTCGAC
GTCCGGGAGA TCGTCGCCAG GAACATCAAC GCCGCCGCGG TCAAGGAGCG CGAGGACACG
GCCAAGCTGA ATGCCACGGT GCGCGAGGAA CTCGCCGGCA AGGGCCTGAT CTTCAATCAG
CCGACGGTGA CGCCGTTCCG CGACAAGCTG CGGTCGGCCG GGTTCTACGC CGAGTGGAAG
GGCAAATACG GCGACCAGGC GTGGTCGCTG CTCGAGAAGT CCGTCGGCAA GCTCGCGTAA
 
Protein sequence
MSVSRRTLLK ASAAAAALGG IGMPTVARAA EAEFVYKYAN NLPDTHPLNV RAREMSAAIK 
AETNGRFDLQ IFPNNQLGSD TDMLSQIRSG GVEFFTLSGL ILSTLVPAAS INGIGFAFPD
YDTVWKAMDG ELGGHVRGEI TKAGLVVMDK IWDNGFRQTT SSTRPINGPE DFKGFKIRVP
VSPLWTSMFK AFDASPASIN FSEVYSALQT KVVEGQENPL ALISTAKLYE VQKYCSLTNH
MWDGFWFLAN RRSWERLPVD VREIVARNIN AAAVKEREDT AKLNATVREE LAGKGLIFNQ
PTVTPFRDKL RSAGFYAEWK GKYGDQAWSL LEKSVGKLA