Gene RPB_4434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4434 
Symbol 
ID3912249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5024548 
End bp5025684 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content65% 
IMG OID637886339 
Productextracellular ligand-binding receptor 
Protein accessionYP_488031 
Protein GI86751535 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATGA AGAGATTGAT GGTGGCGATC GGGCTTGCGC TCGCTGCGCC GGTGGCGGCC 
GATGCGCAGA ACATCGTGAT CGGCGCCAGC ATCCCCGATA CCGGGCCGGC CGCCGCGCCC
GCGATATGGC AGCGGTGGGG TTACCAGCTG GCGCTCGACG AGGCCAATGC GGCGGGCGGC
GTGCTCGGCA AGAAAGTCGA GATGCTCGCC TACGACAACC GCTGCAATCC GTCCGAAGCG
GTGAACGTCG CCAACAAGCT GATCGAGGCC AAAGTCGTCG CCATCGTCGG CGCGCATTGT
TCGTCGGCGA CGCTCGCCAC GATGCCGTTG ATCGCCGCGG CGAAGATTCC GCTGGTGGAC
GGCATCGCGT CGAGCCCGAA GATCACCGAT CTGTCCGGCG TCGGCGGCAA CGAATGGACG
TTCCGCATCA ACCCCTCCGA CGACGACATG ATGAACGCGC TCGGCATCTA TCTGAGCGGC
AGCTCGAAGA TCAAGCGTGT GGCCATATTG GGCGAGGACA CCGACTTCGG ACGCGGCGGC
GCCGCGGCCT TCGCGGCGGT CGCCAAGAAG CACGGACTCG AGGTGATCTC GACCGACTTC
CACCCGCAGA GCTATCCGGA CTTCACCGCG CTGCTGACGC GTATCCAGCA GAGCAAGCCG
GACGCGATCG CGATCTTCCA GCTCGCCGGC GATCAGCTCA ACTTCCTGCG CAATGCGATG
CAGCTCGGCG TGCGGATTCC GTTCATCGGC CGTTTCGACC CCGGCGGCAA CAATCTGCAG
ATCATCCAGG CCGGCGGCAT GGAGGGTTCG ATCACCGCGT GGACCTACAG CTATCTGGTC
GATACGGCCG CCAACAAGGC CTTCGCCGCG GAGATCGAGA AGCGCCACAA GACAACGCCG
GTGCTGCAGA CCTGGGCCGG CTACGACGCC ATGCGCCTGC TGCTGGCGGC GATCAAGAAC
GCCGGCGCGA CCGACCCGAC GGCGATCCGC GACGCGATCA AGAAGATCGA GTTCACCAAC
GTCATGGGGG CCAAAGTCAC CTTCGACGAC CACAACCAGG GTGGCAAGGT CGTGCTGATC
GAAGGCGTGG CCGACAAGAA GGTCAAGATC CTGAAGGAAG TCTCGCTGGC GAACTGA
 
Protein sequence
MLMKRLMVAI GLALAAPVAA DAQNIVIGAS IPDTGPAAAP AIWQRWGYQL ALDEANAAGG 
VLGKKVEMLA YDNRCNPSEA VNVANKLIEA KVVAIVGAHC SSATLATMPL IAAAKIPLVD
GIASSPKITD LSGVGGNEWT FRINPSDDDM MNALGIYLSG SSKIKRVAIL GEDTDFGRGG
AAAFAAVAKK HGLEVISTDF HPQSYPDFTA LLTRIQQSKP DAIAIFQLAG DQLNFLRNAM
QLGVRIPFIG RFDPGGNNLQ IIQAGGMEGS ITAWTYSYLV DTAANKAFAA EIEKRHKTTP
VLQTWAGYDA MRLLLAAIKN AGATDPTAIR DAIKKIEFTN VMGAKVTFDD HNQGGKVVLI
EGVADKKVKI LKEVSLAN