Gene RPC_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4043 
Symbol 
ID3969292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4490891 
End bp4492027 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content65% 
IMG OID637927147 
Producthypothetical protein 
Protein accessionYP_533888 
Protein GI90425518 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTTCAG AACCGCTGCT CGACACGATC AGCACAGGCG ATGGTTTGTT ACTGCGTCCG 
ACCGGATCAT GGACGGCGGT GCATGCCGCG GCGCTCGAGC ATTTGTTCGA CGTCGTCGCG
CCGCAGTTGC GGCAGGCCAA GGCGCTGACG ATCGACCTTG CCGAACTGCA TGAAATCGAC
ACCCTCGGTG CCTGGCTGTT GGAAAAGATC TCGCGCCGCG CTGCCCAGGC AGGGCATCCC
GCCAGCGTGG TCGGCGCCGC CGAGCGCTAT GCCGGGCTGA TCGACCAGGT GCGGCAAGTC
AACCGCCGGC AGCCGGCCAA TGCGGTGCGC GGAAGCCTGA TCCTGACGCG ACTCGAAGAC
ATCGGCCGGG CCACCTGGGG CGCGCGCGAA GACCTGTTCG CCTTCCTGCA GATGACGGGC
GCGCTCGCCA ACGCGTTGCT CGGCGTGCTG CGCCGGCCGC GCTCGTTGCG GCTGACGTCG
CTGGTCTATC AGATCTATCG GGTCGGTTGG CAGGCGATCC CGATCATCGG GCTGATCACC
TTCCTGATCG GCGCGATCAT CGCCCAGCAG GGCATCTTCC ATTTTCGCAA GTTCGGGGCG
GAGTCCTATG TCGTCGACAT GGTGGGCATC CTGGTGTTGC GCGAACTCGG CGTCTTGATC
GTGGCGATCA TGGTGGCCGG GCGATCCGGC AGCGCCTACA CCGCCGAACT CGGCTCGATG
AAGATGCGCG AGGAGATCGA CGCGCTGTCG ACCATGGATC TCGATCCGGT CGAGGTGTTG
ATCCTGCCGC GGGTGATCGC GCTGATTATC GCGCTGCCGA TCCTGGCTTT CCTGGGCGCG
ATGGCGGCGC TGTACGGCGG CGGGCTGGTC GCTTGGTTCT ATGGCGGCAT GAGTCCGACC
ATCTTCATCG CGCGACTGCA CGAGGCGGTC TCGATCACTC ATTTCGAGGT CGGCATCATC
AAGGCGCCGT TCATGGCGCT GGTGATCGGC ATCGTCGCCT GCAGCGAAGG CTTGCGGGTC
ATGGGCAGCG CGGAATCGCT CGGCCGGCAG ACCACCGCCT CGGTGGTGAA GTCGATCTTC
TGCGTGATCG TGCTCGACGG GCTGTTCGCC GTGTTCTTCG CCTCGATCGG AATGTAG
 
Protein sequence
MSSEPLLDTI STGDGLLLRP TGSWTAVHAA ALEHLFDVVA PQLRQAKALT IDLAELHEID 
TLGAWLLEKI SRRAAQAGHP ASVVGAAERY AGLIDQVRQV NRRQPANAVR GSLILTRLED
IGRATWGARE DLFAFLQMTG ALANALLGVL RRPRSLRLTS LVYQIYRVGW QAIPIIGLIT
FLIGAIIAQQ GIFHFRKFGA ESYVVDMVGI LVLRELGVLI VAIMVAGRSG SAYTAELGSM
KMREEIDALS TMDLDPVEVL ILPRVIALII ALPILAFLGA MAALYGGGLV AWFYGGMSPT
IFIARLHEAV SITHFEVGII KAPFMALVIG IVACSEGLRV MGSAESLGRQ TTASVVKSIF
CVIVLDGLFA VFFASIGM