Gene RPB_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1474 
Symbol 
ID3908787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1663203 
End bp1664624 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content62% 
IMG OID637883369 
ProductABC transporter substrate-binding protein 
Protein accessionYP_485095 
Protein GI86748599 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0903726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGTA CTTCTGGACA TCGTCTGATT TTCGCCGCCT TCACCGCCGC GATGATGGCG 
AGCACCCCGG TCGCCGCCCA GACCACCGTC ACGGTCGGCA TCGGCACCCA GGACACCACC
ACCAACACCG CGACCACCGG CGTCGTCATT CGGCAGCTGA AGCTGCTGGA GAAGTATCTT
CCCAAGGACG GCAAATACGC GAACGTCAAA TTCGAGTTCG ATTGGCAGAA TTTCACCTCC
GGCCCACCCG TCACCAACGG CATGATGGCC AACAAGCTGC AATTCGGCGG CATGGGTGAC
TATCCGCTGG TGGTGAACGG CTTCACCTTC CAGAGCAACC CCGAGAGCAA GAGCCGCCTG
ATCGCGGTCG CGGCCTACAG CCTCGACGGC TCCGGCAACG GCCTGGTGGT TCACAAGGAC
TCCCCGTACT ATCAGCTGTC CGATCTCAAG GGCAAATTGG TGAGCGTGCC GTTCGGCTCC
GCCGCGCACG GCATGATCCT AAAGGCGATG CAGGATCGCG GCTGGCCCGC GGACTATTGG
CAACTGGTGA GCCAGAGCCC GGAAGTCGGC TCGACCAATC TCCAGGAGAA GAAGATCGAC
GCCCACGCCG ATTTCGTCCC GTTCGCCGAA CTACTGCCGT TCCGCGGTTT CGCCCGCAAG
ATCTTCGACG GCGTCGAGAC CAATCTGCCG ACCTGGCACG GCGTGGTGGT GCGTACCGAC
TTCGCCGAGA AATATCCCGA AGTCGTGGTC GCCTATGTCA AGGCGATCAT CGCGGCCAAT
GCCTGGCTGC GCGCCGATCC GAAGCTCGCC GCCGAAAAGA TCCAGGAATG GACCGGCATC
AACAAGGAAG TGGTCTACAT CTTCCTGGGA CCGGGCGGCA ACATGACCAC CGATCCGACG
ATCAAGCCGC AGCTGATCGA GGCCGCCGCG GTCGACGTCA AGGTGCTGCA GAATCTCGGC
CGCATGAAGG AATTCGATCC GAAGAGCTGG GTCGACGACA GCTACATCCG CAAGGCCTAT
GCCGAACTGA AGCTCGACTA CGACGCCGAG CTGAAGAGCA CCAAGAACTA CGAAATCAGC
GGCGAGGACA AATTCTGCAA GAAGCCGATC ACCGAGCCGC GCAAGGCCGG CGAGGTCTGG
GTCGACGGCG ACGGCATCGA GCCGTTCAGC AGCGCGGCCT GCACGCTCGC CGCTTATGCG
GACATCAAGG CCAAGGGCAA GAAGATCAAC ATGGCCTATG TGTTCGACTC CGCCCGGGGC
ATCAAGCTGT TCGCCGACCA GGCCTACTAC ACGGTCGGCG CCGACAAGGC GCAGTTGTCG
CCGTTCCTGC TCAAGAAGGA CGCCGAAGCG CATGCCGCCA AGATCAACGG CAAGGTGCTG
AATTTCGACG AAGCCCTCAA GTCAGCGGTC AGCGGAGGTT GA
 
Protein sequence
MVRTSGHRLI FAAFTAAMMA STPVAAQTTV TVGIGTQDTT TNTATTGVVI RQLKLLEKYL 
PKDGKYANVK FEFDWQNFTS GPPVTNGMMA NKLQFGGMGD YPLVVNGFTF QSNPESKSRL
IAVAAYSLDG SGNGLVVHKD SPYYQLSDLK GKLVSVPFGS AAHGMILKAM QDRGWPADYW
QLVSQSPEVG STNLQEKKID AHADFVPFAE LLPFRGFARK IFDGVETNLP TWHGVVVRTD
FAEKYPEVVV AYVKAIIAAN AWLRADPKLA AEKIQEWTGI NKEVVYIFLG PGGNMTTDPT
IKPQLIEAAA VDVKVLQNLG RMKEFDPKSW VDDSYIRKAY AELKLDYDAE LKSTKNYEIS
GEDKFCKKPI TEPRKAGEVW VDGDGIEPFS SAACTLAAYA DIKAKGKKIN MAYVFDSARG
IKLFADQAYY TVGADKAQLS PFLLKKDAEA HAAKINGKVL NFDEALKSAV SGG