Gene RPB_3566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3566 
Symbol 
ID3911368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4086879 
End bp4088114 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content63% 
IMG OID637885468 
Productextracellular ligand-binding receptor 
Protein accessionYP_487172 
Protein GI86750676 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.172998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.680161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCCC TCTCCCGTTC GATCGCCACC CTGGCGGCCG CCGCCCTGCT GTCCGCAGCC 
GCCGGACAGG CGATGGCGCA GAAGAAATAT GGCCCCGGCG CCAGCGACAC CGAAGTCAAG
ATCGGCAACA TCGTGCCTTA CTCGGGCCCG GCGTCGGCCT ATGGCAGCGT CGGCAAGGCG
CAAGAAGCCT ATTTCAAGAT GATCAACGAC AAGGGCGGCA TCAACGGCCG CAAGATCGTC
TACATCTCCA ACGACGACGC CTATTCGCCG CCGAAATCGG TCGAGCAGAC CCGTAAGCTG
GTCGAGAGCG ACGAGGTGCT GTTCATGTTC AGCCCGCTCG GCACGCCGTC CAACACCGCG
ATCCAGAAAT ATCTCAACGC CAAGAAAGTG CCGCATCTGT TCCTGGCGTC GGGCGCCACC
AAGTGGAACG ATCCGAAGCA CTTTCCGTGG ACGATGGGCT GGCTGCCGAG CTACCAGAGC
GAAGGCCGGA TCTACGCCAA GTATCTGATG AAGGAGAAGC CGGACGCCAA GATCGCCGTG
CTGTATCAGG GCGACGATTT CGGCAAGGAC TATCTCAAGG GCCTCAAGGA CGGCCTCGGC
GCCAAGGCTT CGCAGGTGGT GATCGAGGAC AGCTACGAGC TGACCGAGCC GACCGTCGAT
TCCCACATCG TCAAAATCAA GGCCGCCAAT CCCGACGTGC TGGTGATCTT CGCCACGCCG
AAATTCGCGG CGCAGACCAT CAAGAAGGTC GCCGAACTGG CGTGGAAGCC GATGATGATC
GTGCCGAACG TCTCGGCCTC GACCGGCAGC GTGATGAAGC CCGCCGGCTT CGAGAACGCC
CAGGGCATCG TCTCCGCCTC CTACGCCAAG GACGCCACCG ACAAGCAGTG GGAAAACGAC
CCCGGCATGA AGGCGTATTA CGAGTTCATG GAGAAGTATG CGCCGCAGGC CAGCCGCGCT
GACTCATCGT TCATGACCGG CTACAACATC GCCGAGACGG TCGCGGTGCT GATCAAGCAA
TGCGGCGACG ATCTGTCCCG CGAGAACGTC ATGAAGCAGG CGGCCAACCT CAAGGACGTC
CAGCTCGGCG GCCTGCTGCC GGGCATCAAG CTCAACACCA GCGCGACCGA CTTCGCGCCG
ATCGAACAGC TGCAGCTGAT GAAGTTCCAG GGCGAGAACT GGAAGCTGTT CGGCGACGTG
ATCGAGGGCG AAGTCGCCGC GCCGACCGGC GGCTAG
 
Protein sequence
MSALSRSIAT LAAAALLSAA AGQAMAQKKY GPGASDTEVK IGNIVPYSGP ASAYGSVGKA 
QEAYFKMIND KGGINGRKIV YISNDDAYSP PKSVEQTRKL VESDEVLFMF SPLGTPSNTA
IQKYLNAKKV PHLFLASGAT KWNDPKHFPW TMGWLPSYQS EGRIYAKYLM KEKPDAKIAV
LYQGDDFGKD YLKGLKDGLG AKASQVVIED SYELTEPTVD SHIVKIKAAN PDVLVIFATP
KFAAQTIKKV AELAWKPMMI VPNVSASTGS VMKPAGFENA QGIVSASYAK DATDKQWEND
PGMKAYYEFM EKYAPQASRA DSSFMTGYNI AETVAVLIKQ CGDDLSRENV MKQAANLKDV
QLGGLLPGIK LNTSATDFAP IEQLQLMKFQ GENWKLFGDV IEGEVAAPTG G