Gene RPB_0308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0308 
Symbol 
ID3908687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp348932 
End bp350176 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content64% 
IMG OID637882192 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_483930 
Protein GI86747434 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.263961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCATCA ATTGCGGAAA TGACATCGTT CATGCCGATG ACGATCGCTT CCGGTTCGCG 
ATCAACCGCC GGCAGACGCT CGACATGCTC GCCGCCGGCG GCCTCGCCGC GCTCGGCACC
ATGCTGGGCG GCTTCGGCCA ATCCGCACGC GCGGCCGACG ACGACGTCGT CCGCATCGGC
TATCTGCCGA TCACCGACGC CACCGCGCTC CTGGTCGCGC ACGGCATGGG CTACTTCAAG
GACGAGGGTC TCGAGGCCGA GCGCCCGACG CTGATCCGCG GCTGGTCGCC GCTGGTGGAG
AGCTTCGCGG CCGGCAAGTT CAACCTGGTG CATCTGCTCA AGCCGATCCC GGTGTGGATG
CGCTACAACA ACAACTTCCC GGTCAAGATC ATGGCCTGGG CCCACACCAA CGGCTCCGGC
GTCGTGGTCG GCGGCGAGAG CGGCATCGCG TCGTTCAAGG ATTTCGGCGG CAAGCAGGTC
GCGGTGCCGT TCTGGTACTC GATGCACAAC ATCGTGCTGC AATACGCGCT GCGCAAATCC
GGCATCAAGC CGGTGATCAA GGGCCAGGGC GAGACGCTCG CGGCCGACGA GTGCAATCTG
CAGGTGATGG CGCCGCCCGA CATGCCGCCG GCGCTCGCGG CCAAAAAGAT CGACGCCTAC
ATCGTCGCCG AGCCGTTCAA CGCGCTGGGC GAAACCAAGG CCGGCGGCCG GATGCTGCGC
TTCACCGGCG ACATCTGGAA GAATCACCCC TGCTGCGTGC TGTGCATGAA CGAGGAGGTG
ACCAGAAAGA AGCCGGAATG GACCCAGAAG GTGATGAACG CCCTGGTCCG CGCCGAGATC
TACGCCAGCG CCAACAAGAA GGAAGTCGCC AAGCTGCTGT CGAAGGACGG CGAAGGCTAT
CTGCCGCTGC CGGCGCCGGT GATCGAGCGC GCGATGACTT ACTACGACGA CAAGACCTAT
GGCGAGAGCG GCGCCATTAC CCATCCGGAC TGGAAGCTCG GCCGGATCGA CTTCCAGCCC
TGGCCGTATC CGTCGGCGAC CAAGCTGATC GTCGGCGCGA TGAACGAGAC GGTGGTCTCG
GGCGACACCA CCTTCCTGAA GAAGCTCGAT CCGGAATTCG TCGCCAAGGA TCTGGTCGAC
TATCACTTCG TCAAGCAGGC GATGACGAAG TACCCGGACT GGAAGACATC GCCGAGCGTC
AATCCCGACG ATCCGTTCGC CCGGACCGAG GTGCTGTCGC TGTGA
 
Protein sequence
MCINCGNDIV HADDDRFRFA INRRQTLDML AAGGLAALGT MLGGFGQSAR AADDDVVRIG 
YLPITDATAL LVAHGMGYFK DEGLEAERPT LIRGWSPLVE SFAAGKFNLV HLLKPIPVWM
RYNNNFPVKI MAWAHTNGSG VVVGGESGIA SFKDFGGKQV AVPFWYSMHN IVLQYALRKS
GIKPVIKGQG ETLAADECNL QVMAPPDMPP ALAAKKIDAY IVAEPFNALG ETKAGGRMLR
FTGDIWKNHP CCVLCMNEEV TRKKPEWTQK VMNALVRAEI YASANKKEVA KLLSKDGEGY
LPLPAPVIER AMTYYDDKTY GESGAITHPD WKLGRIDFQP WPYPSATKLI VGAMNETVVS
GDTTFLKKLD PEFVAKDLVD YHFVKQAMTK YPDWKTSPSV NPDDPFARTE VLSL