Gene RPD_0357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0357 
Symbol 
ID4020822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp421620 
End bp422846 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content59% 
IMG OID637960541 
Productextracellular ligand-binding receptor 
Protein accessionYP_567496 
Protein GI91974837 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000767142 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTTCT ACGGCGTGAC ATCGTTGTCG GGCGCGGTGA GGCTTGCCTT TTTTGCTTCC 
GCGGTATGCG GCCCGGGCGC ATTCAATGGC GCGGCGGCTC AGGAGCCCGT GCGGATCGGT
GTGATCACCG ATATGACCGG GCCGTATTCT TCACTCTCCG GGCCGGGCGT CGTTGTCGGC
ATGAAAATGG CGGTCGATGA GTTTGGCGGA AAAGTCCTCG GGCAGCCGAT CGAGGTGCTC
AGCGCCGACA GCGGGCTGAA GGCCGATATC GCGCTCTCGC GCGCGCGCGA ATGGTACGAT
CGCCAGAATG TCCATATGAT CGTCGAATCG TCTGATTCCG GGTCGGCTGT CGCCCTTCAA
AAGCTCGGCG CCGACAAAAA GAAGATCACG ATGTTCCACT CGGGCACCAC GGCGCTCACG
AATCTCGAGT GCTCGCCTTA CGGGGTGCAT TATGCGTGGG ATACCTATTC CATGGCGAGC
GGAGCCGCCC GGGCAGCCGT CCAGGCGGGT GGGAATTCCT GGTACTTCAT CACCGCGGAC
TACGTCTTCG GGAAATCCCT CGAGGCCGAC GCATCCAAGA TCATACGCCA GCTCGGCGGC
GACATCATTG GCGGCGTTCG ACACCCGCTG AATGTGCCCG ACTTCGCGTC GTTTCTGCTG
TCCGCTCAAC AGTCCAAAGC CAAGGTCGTC GGACTGGCTA ATGCCGGGAG CGACACTCAG
AATGCCGTCA AGCAGGCCGC AGAGTTTGGG TTGGGCGGCG GACAGAAGGT CGTTCCGCTG
CTGATGTTCG ACACCGATGT GAAGGGGCTT GGACTAAAAG TCGCGCAAGG GATGGAATTC
GCGACGGCGT TCTATTGGGA CTACGACGAT AAATCGCGCG AATTCGCCAA CAAGTTCTTC
GCAATCCATA AGAGCATGCC GACGATGAAC CATGCAGGGT CCTATTCGGC AACCCTGCAG
TATCTGAAGG CTGTCCAGGC GACCGGCTCG CTGGATGCCG ACAAGGTGAT GAAGTACCTC
AAATCCGCAA AAATCGAAGA CGCTTTCGCC CGCAACGGCC GAATCCGCGT TGATGGACGG
ATGGTTCACG ACATCTATCA GGTGCGGGTC AAGACGCCGG AAGAATCCAC GGGCCCGTCA
GATATCCTGA AGGTCATTCT GACCATCAAG GGTGATGATG CCTTCATGCC CCTTGCGGAT
AGCACATGTC CGCTCGTCAA GAAGTAG
 
Protein sequence
MKFYGVTSLS GAVRLAFFAS AVCGPGAFNG AAAQEPVRIG VITDMTGPYS SLSGPGVVVG 
MKMAVDEFGG KVLGQPIEVL SADSGLKADI ALSRAREWYD RQNVHMIVES SDSGSAVALQ
KLGADKKKIT MFHSGTTALT NLECSPYGVH YAWDTYSMAS GAARAAVQAG GNSWYFITAD
YVFGKSLEAD ASKIIRQLGG DIIGGVRHPL NVPDFASFLL SAQQSKAKVV GLANAGSDTQ
NAVKQAAEFG LGGGQKVVPL LMFDTDVKGL GLKVAQGMEF ATAFYWDYDD KSREFANKFF
AIHKSMPTMN HAGSYSATLQ YLKAVQATGS LDADKVMKYL KSAKIEDAFA RNGRIRVDGR
MVHDIYQVRV KTPEESTGPS DILKVILTIK GDDAFMPLAD STCPLVKK