Gene RPD_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0052 
Symbol 
ID4020506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp64609 
End bp65649 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content63% 
IMG OID637960228 
Productinner-membrane translocator 
Protein accessionYP_567193 
Protein GI91974534 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.460999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAT TGATCTCGAA CCCCACCGCT CGTCGCACGC CGATCCGGAT GTTCGTCGGC 
CTGTTGGCGC TGTTCGCGGT GCTGCCCTGG CTGCTGACGG CGATCGGCCT CGGCGTCAAT
CTGGCCACCG AAGTGCTGAT CATCGCGCTG TTCGCGATGA GCTACAACAT TCTGCTGGGC
ACCACCGGCC TCGCATCTTT CGGCCACGCC GCATTCTTCG GCTCGGGCGC CTATGCGGTC
GGAATCCTGC AGCGCTATGG CCTGAATGGA ATCGTCATCA GCCTGGCCGC AGCGATCGCC
GCCGGGCTGG TCGCGTCACT GTTCGTCGGC CTGCTTGTCA GAAAGAAGCG CGGAATCTAT
TTCGGCCTGC TGACGCTGTC GTTCGGCCAG ATGTTCTACA TCGTGGCGCT GCGCTGGGAT
GAGCTGACCG GCGGCGAGAC CGGGCTGACG GGCCTGAAGC GGCCCGCGCC GTTCGGCCTC
GATCTCAGCA GCCATATCAA TTTCTACTAC TTCACGCTGG CGATCTTCAT GGTCGCGCTG
TGGCTGATCT GGCGGATCAC CAATTCGCCG TTCGGCAGTC TGCTGACGGC GATCAAGAGC
AACGAGGTCC GCACCCAGTA TCTCGGCTAC GACACCGCGC TCTACAAGCT GGCCGCGATC
GTCATCTCCG GATCGTTCTC CGGACTCGCC GGCGGCCTCT ATGCGTGGTT CCAGTACGCG
GCCTATCCGC AGAACCTGTT CTGGATCGAA TCCGGCAACA TCGTCATCCT GACGTTGCTC
GGCGGCGGCC TCTCCAGCTT CTTCGGCCCG ATCCTCGGCG CCGCGGTGTT CGTCGGCGCG
CAGGACCTGA TCAGCGGCTA CACCCAGCAC TGGATGTTCT TCTTCGGGCT GATCTTCATC
GTCGTGGTCA CGACGTTCCC CAACGGCCTG CCGGAAGCCT TCGCGAAATT CGTCGCTTCG
GCGCGGCGGA GGTTCGGCCG CACGGCCGGG GAGACCGTGA TCTCTGCGCA ATCCTCATCG
CGCTACGGAG CGGACCAATG A
 
Protein sequence
MNALISNPTA RRTPIRMFVG LLALFAVLPW LLTAIGLGVN LATEVLIIAL FAMSYNILLG 
TTGLASFGHA AFFGSGAYAV GILQRYGLNG IVISLAAAIA AGLVASLFVG LLVRKKRGIY
FGLLTLSFGQ MFYIVALRWD ELTGGETGLT GLKRPAPFGL DLSSHINFYY FTLAIFMVAL
WLIWRITNSP FGSLLTAIKS NEVRTQYLGY DTALYKLAAI VISGSFSGLA GGLYAWFQYA
AYPQNLFWIE SGNIVILTLL GGGLSSFFGP ILGAAVFVGA QDLISGYTQH WMFFFGLIFI
VVVTTFPNGL PEAFAKFVAS ARRRFGRTAG ETVISAQSSS RYGADQ