Gene RPB_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2056 
Symbol 
ID3909871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2338230 
End bp2339147 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content68% 
IMG OID637883949 
ProductABC transporter related 
Protein accessionYP_485674 
Protein GI86749178 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID[TIGR01184] nitrate transport ATP-binding subunits C and D 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA AGTTCATTTC GATCGAGGGC ATCGCCAAGC GCTATCCCGG CGCGGCCGGC 
GCCGGCGACA CCACGATCTT CGAGAACCTC TGGTTGTCGC TGCCGCGCGG CGAGTTCGGC
TGCGTCATCG GCCATTCCGG CTGCGGCAAG ACCACGGTGC TCAACATCCT CGCCGGGCTC
GACGCGCCCA GCGAAGGCGC GGTGATCGTC GACGGCCAGG CGATCGAGGG CACCAGCCTC
GACCGCGCGG TGATCTTCCA GAGCCACGCG CTGCTGCCGT GGCGCACGGT GATGGGCAAC
GTCGCCTATG CGGTGAGTTC GAAATGGCGC AAATGGGACA AGGCGCGCGT CCGCGCCCAC
GCCCAGCAAT TCATCGACCT CGTCGGCCTG ACCGGTTCGG AGCACAAGCG GCCCTCGGAA
CTGTCCGGCG GCATGAAACA GCGCGTCGGC ATCGCCCGCG CGCTGAGCAT CACGCCGAAG
ATCATGCTGA TGGACGAGCC GTTCTCGGCG CTCGACGCGC TGACCCGCGG CTCGCTGCAG
GACGAGGTCC GCCGGATCTG TCTGGAGACC GGCCAGACCA CCTTCATGAT CACCCACGAC
GTCGACGAGG CGATGTATCT CGCCGATAAA ATCTTCCTCA TGACCAACGG CCCCGGCGCC
GTGGTGGCGG AGATCGTCGA GAACCCGCTG CCGAAGGATC GCGCAAGGAT CGATCTGCAC
CGGCATCCTT ATTACTACGC GCTGCGCAAC CACATCGTCG ACTTCCTGGT GACGCGCAGC
AAGACCTTCG CCGCCGCCAA TCCGAACCAC GATCCGCTCG CCGTGCCGGT GGTGCGCCCC
GGCCTCGGCG AACCCGCTCT GGTGCCGGCC GCGAACGGCG CCGGCGCGTC GGCTCCGGCG
CAGCTCCGCG CGCGCTGA
 
Protein sequence
MIDKFISIEG IAKRYPGAAG AGDTTIFENL WLSLPRGEFG CVIGHSGCGK TTVLNILAGL 
DAPSEGAVIV DGQAIEGTSL DRAVIFQSHA LLPWRTVMGN VAYAVSSKWR KWDKARVRAH
AQQFIDLVGL TGSEHKRPSE LSGGMKQRVG IARALSITPK IMLMDEPFSA LDALTRGSLQ
DEVRRICLET GQTTFMITHD VDEAMYLADK IFLMTNGPGA VVAEIVENPL PKDRARIDLH
RHPYYYALRN HIVDFLVTRS KTFAAANPNH DPLAVPVVRP GLGEPALVPA ANGAGASAPA
QLRAR