Gene RPB_1760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1760 
Symbol 
ID3909747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2013046 
End bp2014350 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content64% 
IMG OID637883654 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_485379 
Protein GI86748883 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.135938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0152873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGCA AGGACCGCAC CGGAAGCAGA GTGAGCCGCC GCCAACTCCT GAAGGCCGGT 
GGCAGCGCAG CCGCATTGCT CGCTGCAGCG AAGCTGAACT TGCCCGGCGG CGCCTTCGCG
CAGGAGGGCG GCCCCGAGGT GAAGGGCGCC AAGCTCGGCT TCATCGCGCT GACCGACGCC
TCGCCGCTGT TCGTCGCCAA GGAGAAGGGC ATCTTCGCCA AATACGGCAT GCCCGACGTC
GAAGTGCTGA AGCAGGCGTC GTGGGGCACC ACGCGCGACA ACCTCGTGCT CGGCTCCGAA
GGCAACGGCA TCGACGGCGC GCACATCCTC ACGCCGATGC CCTATCTGAT CAGCGCCGGC
AAGGTGACGC AGAACAACGT GCCGACGCCG ATGTACATAC TGGCGCGGCT CAATCTGAAC
GGCCAGTGCA TCTCGGTCGC CAAGGAATAT GCCGACATCA AGGTCGGTCT CGACACCGCG
CCGTTCAAAG TCGCGCTGGA GAAGAAGAAG GCCTCCGGCA AGGCGATCAA GGCGGCGATG
ACCTTCCCCG GCGGCACCCA CGATCTGTGG ATCCGCTACT GGCTCGCGGC CGGCGGCATC
GATCCCGACA AGGACATCGA AACCATCGTG GTGCCGCCGC CGCAGATGGT GGCCAACATG
AAGGTCGGCA CGATGGACTG CTTCTGCGTC TGCGAGCCGT GGAATCTGCA GCTGATCCAC
CAGAACATCG GCTACACCGC GATCACCACC GGCGAACTCT GGAACAAGCA TCCGGAAAAA
TCCTTCGGCA TGCGCGCCGC CTGGGTCGAC AAGTACCCGA AGGCCGCCAA GGCGCTGCTG
ATGGCGGTGC TGGAAGCGCA GCAATGGTGC GACAGGCCGG AGAACCGCGA CGAGGTCGCG
GCGATCTGCG CCAAGCGGCA GTGGATCAAC TGCCCGGTCG ACGATGTCAC CGACCGGGTC
AAGGGCAAGT TCGACTACGG CACCGGCAAG GTGGTCGAGA ACTCGCCGCA CATCATGAAG
TTCTGGGACG ACTTCGCCTC CTACCCGTAT CAGAGCCACG ATCTGTGGTT CATGACCGAG
GACATCCGCT GGGGCAAGTA CGAACCGGGC TTCGACAGCA AGGCGCTGAT CGCCAAGGTC
AACCGCGAGG ATTTGTGGAA GGACGCCGCC AAGGCGCTGG GCGTCAGCGC GCTGCCGGCC
TCGACCTCGC GCGGCCAGGA GACGTTCTTC GACGGCAAGG TGTTCGACCC GGCAGATCCC
GCCGCGTACC TGAAGTCGCT GTCGATCAAG CGCGTCGATG CGTAA
 
Protein sequence
MISKDRTGSR VSRRQLLKAG GSAAALLAAA KLNLPGGAFA QEGGPEVKGA KLGFIALTDA 
SPLFVAKEKG IFAKYGMPDV EVLKQASWGT TRDNLVLGSE GNGIDGAHIL TPMPYLISAG
KVTQNNVPTP MYILARLNLN GQCISVAKEY ADIKVGLDTA PFKVALEKKK ASGKAIKAAM
TFPGGTHDLW IRYWLAAGGI DPDKDIETIV VPPPQMVANM KVGTMDCFCV CEPWNLQLIH
QNIGYTAITT GELWNKHPEK SFGMRAAWVD KYPKAAKALL MAVLEAQQWC DRPENRDEVA
AICAKRQWIN CPVDDVTDRV KGKFDYGTGK VVENSPHIMK FWDDFASYPY QSHDLWFMTE
DIRWGKYEPG FDSKALIAKV NREDLWKDAA KALGVSALPA STSRGQETFF DGKVFDPADP
AAYLKSLSIK RVDA