Gene RPB_3575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3575 
Symbol 
ID3911377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4099019 
End bp4100176 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content65% 
IMG OID637885477 
Productextracellular ligand-binding receptor 
Protein accessionYP_487181 
Protein GI86750685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.807298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.720427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGT TCAAACTATC TGCTGCTGCG TTCGCCGTGG CGATCGCGCT GCCGGCGATG 
TCCGGCGCCG CGCTCGCCGA GACCAATGAA ATCACCGTGG GCATCACCGT CACCACGACG
GGCCCGGCCG CCGCGCTCGG CATTCCGGAG CGCAACGCGC TGGAATTCGT GGCCAAGGAA
ATCGGCGGTC ACCCGATCAA GATGATCGTG CTCGACGACG GCGGCGACCC GACCGCGGCG
ACCACCAACG CGCGGCGTTT CGTCACGGAG TCGAAGGCCG ACGTGATCAT GGGTTCGTCG
GTGACGCCGC CGACCGTGGC GGTCTCGAAC GTCGCCAACG AGGCGCAGGT GCCGCATATC
GCGCTGGCGC CGCTGCCGGT CACGCCGGAG CGGGCGAAGT GGTCCGTGGT GATGCCGCAG
CCGATCCCGA TCATGGGCAA GGTGCTCTAC GAGCACATGA AGAAGAACAA CATCAAGACC
GTCGGCTACA TCGGCTATTC CGACAGCTAC GGCGATCTGT GGTTCAACGA TCTCAAGAAG
CAGGGCGAGG CGATGGGCCT CAAGATCGTC GCCGAGGAAC GCTTCGCGCG CCCCGACACC
TCGGTCGCGG GTCAGGTGCT GAAGCTCGTT GCCGCCAATC CCGACGCCAT CCTGGTCGGG
GCGTCCGGCA CCGCCGCGGC GCTGCCGCAG ACCGCGCTGC GCGAGCGCGG CTACAACGGG
CTGATCTATC AGACCCACGG CGCCGCCTCG ATGGACTTCA TCCGCATCGC CGGCAAGTCC
GCCGAGGGCG TGCTGATGGC GTCCGGCCCG GTGATGGATC CGGAAGGCCA GAACGACAGC
GCGCTGACCA AGAAGCCCGG CCTCGAACTC AACACGGCCT ATGAAACCAA GTACGGCCCG
AACAGCCGCA GCCAGTTCGC CGGCCACTCC TTCGACGCCT TCAAGGTACT CGAGCGCGTG
ATTCCGGTGG CGCTGAAGAC CGCCAAGCCC GGCACGCAGG AATTCCGCGA AGCGATCCGT
AAGGCGCTGC TCACCGAAAA GGACATCGCG GCGAGCCAGG GCGTCTACAG CTTCACCGAG
ACGGATCGCT ATGGTCTCGA CGATCGTTCG CGCATCCTGC TGACGGTGAA GAACGGCAAA
TACGTCATCG TCAAGTAA
 
Protein sequence
MNGFKLSAAA FAVAIALPAM SGAALAETNE ITVGITVTTT GPAAALGIPE RNALEFVAKE 
IGGHPIKMIV LDDGGDPTAA TTNARRFVTE SKADVIMGSS VTPPTVAVSN VANEAQVPHI
ALAPLPVTPE RAKWSVVMPQ PIPIMGKVLY EHMKKNNIKT VGYIGYSDSY GDLWFNDLKK
QGEAMGLKIV AEERFARPDT SVAGQVLKLV AANPDAILVG ASGTAAALPQ TALRERGYNG
LIYQTHGAAS MDFIRIAGKS AEGVLMASGP VMDPEGQNDS ALTKKPGLEL NTAYETKYGP
NSRSQFAGHS FDAFKVLERV IPVALKTAKP GTQEFREAIR KALLTEKDIA ASQGVYSFTE
TDRYGLDDRS RILLTVKNGK YVIVK