Gene RPB_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2686 
Symbol 
ID3910479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3069926 
End bp3070945 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content65% 
IMG OID637884586 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_486299 
Protein GI86749803 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTT CGCGCAGGAC ATTGTTGCAG GCATCGGCGG CGGCGGTCGC GCTCGGCGGC 
ATCGGGGCGC CGTTCGTGGC GCGCGCGGCG GAGGCCGAGT TCGTCTACAA ATACGCCAAC
AACCTGCCCG ACACCCATCC GATGAACATC CGCGCCCGCG AGATGGCGGC GGCGATCAAG
GCCGAGACCA ATGGCCGGGT CCAGATCGAC ATTTTCCCGA GCAACCAGCT CGGCTCCGAC
ACCGACATGC TGAGCCAGAT CCGCTCCGGC GGCGTCGAGT TCTTCACACT GTCCGGCCTG
ATCCTGTCGA CGCTGGTGCC GGCGGCCTCG ATCAACGGTA TCGGCTTCGC ATTCCCGGAC
TACGACACGG TCTGGAAAGC GATGGACGGC GAGCTCGGCG GCTATGTGCG CGGCGAGATC
GGCAAGGCCG GGCTGGTGGT GATGGACAAG ATCTGGGACA ACGGCTTCCG CCAGACCACG
ACCTCGACCC GGCCGATCAC CGGCCCGGAC GACTTCAAGG GCCTCAAGAT CCGCGTGCCG
GTGTCGCCGC TGTGGACCTC GATGTTCAAG GCGTTCGACG CCTCGCCCGC CTCGATCAAT
TTCAGCGAGG TCTATTCGGC GCTGCAGACC AAGGTCGTCG AAGGCCAGGA GAACCCGCTG
GCGATCATTT CGACCGCGAA GCTCTATGAA GTGCAGAAAT ACTGCTCGCT GACCAATCAT
ATGTGGGACG GCTTCTGGTT CCTCGCCAAC CGCCGCGCCT GGGAACGGCT GCCAGCCGAT
CTGCGCGACA TCGTCGCCAG GAACATCAAC GCCGCCGGCG TCAATCAGCG CGCCGACGTC
GCCAAGCTCA ACGCCGGCCT GAAGGACGAA CTCGCCACCA AGGGCCTGAC CTTCAACCAG
CCGACCATCG GGCCGTTCCG CGACAAGCTG CGCGCCGCCG GCTTCTACGC CGAATGGAAA
GGCAAATACG GCGAGCAGGC CTGGTCGCTG CTGGAGAAAT CCGTCGGCAA GCTCGCCTGA
 
Protein sequence
MSVSRRTLLQ ASAAAVALGG IGAPFVARAA EAEFVYKYAN NLPDTHPMNI RAREMAAAIK 
AETNGRVQID IFPSNQLGSD TDMLSQIRSG GVEFFTLSGL ILSTLVPAAS INGIGFAFPD
YDTVWKAMDG ELGGYVRGEI GKAGLVVMDK IWDNGFRQTT TSTRPITGPD DFKGLKIRVP
VSPLWTSMFK AFDASPASIN FSEVYSALQT KVVEGQENPL AIISTAKLYE VQKYCSLTNH
MWDGFWFLAN RRAWERLPAD LRDIVARNIN AAGVNQRADV AKLNAGLKDE LATKGLTFNQ
PTIGPFRDKL RAAGFYAEWK GKYGEQAWSL LEKSVGKLA