Gene RPD_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1353 
Symbol 
ID4021830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1524114 
End bp1525250 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content64% 
IMG OID637961546 
Producthypothetical protein 
Protein accessionYP_568492 
Protein GI91975833 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTTCTG CGCCAGTTCT GACTGTAGTG ACCGACGGCG ACGTGCTGGA ACTGCATCCC 
GGCGGCGCGT GGATCGCCAG CCAATCGGCC GCGCTGGAAC GGCTGTTCGA GGGCGTCGCG
CCGCAGGTCG CCGCTGCAAA ATCACTCAAG ATCGACATGA CCGAGGTGAT CGAGATCGAC
ACCATCGGCG CTTGGCTGCT GGAGAAGGCG TCACGCGAGG CCGCGCAGGC TGGTCGCACC
GCGCATTTCG TCGGGGTCGG CGAACGCTAC GCCGGGTTGA TCGAAGAGGT CCGGCAAGTC
AACCGCCACA GACCGACGCC GAAACCCAAG GTCAATCCGA TCATCGCGCG ACTCGATCAG
GTCGGTCGCT CAGCCTGGAG CGCCACGCAG GACATCGCGG TATTCCTCGA CATGTTCGGT
GCGCTCGGCG TCGCGTTGCT CGGCGTGCTG CGGCGGCCGC GTTCGCTGCG GCTGACCTCG
TTGACCTACC AGATCTATCG CGTCGGCTGG CGGGCGATCC CGATCGTCGT GCTGATCACC
TTCTTGATCG GCGCGATCAT CGCGCAGCAG GGCATTTTCC ACTTCCGCAA ATTTGGTGCG
GAATCCTACG TGGTCGACAT GGTCGGCATC CTGGTGTTGC GCGAGATCGG CGTTCTGATC
GTCGCCATCA TGGTCGCCGG CCGCTCGGGC AGCGCCTACA CGGCCGAACT CGGCTCGATG
AAAATGCGCG AGGAGATCGA CGCGCTATCG ACCATGGGGC TCGACCCGGT CGAGGTGCTG
ATCCTGCCAC GCATCATCGC GCTGGTGATC GCGCTGCCGA TCCTGACCTT CATTGGATCG
ATGTCGGCGC TGTACGGCGG ATTGCTGACC GCGTGGTTCT ACGGCGGCAT GCAGCCCGCG
GTATACATCG CGCGGTTGCA CGAGGCGGTG TCGCTCAACA GTTTCGAGGT CGGGATCTGG
AAGGCGCCGT TCATGGCGCT GGTGATCGGC ATCGTCGCCT GCAGCGAGGG CCTGCGGGTC
AAGGGCAGCG CCGAGTCGCT CGGCCTGCAG ACCACCACTT CGGTGGTGAA GTCGATCTTT
CTGGTGATCG TGCTCGATGG CCTGTTCGCT GTATTCTTCG CCTCGATCGG GTTGTAG
 
Protein sequence
MISAPVLTVV TDGDVLELHP GGAWIASQSA ALERLFEGVA PQVAAAKSLK IDMTEVIEID 
TIGAWLLEKA SREAAQAGRT AHFVGVGERY AGLIEEVRQV NRHRPTPKPK VNPIIARLDQ
VGRSAWSATQ DIAVFLDMFG ALGVALLGVL RRPRSLRLTS LTYQIYRVGW RAIPIVVLIT
FLIGAIIAQQ GIFHFRKFGA ESYVVDMVGI LVLREIGVLI VAIMVAGRSG SAYTAELGSM
KMREEIDALS TMGLDPVEVL ILPRIIALVI ALPILTFIGS MSALYGGLLT AWFYGGMQPA
VYIARLHEAV SLNSFEVGIW KAPFMALVIG IVACSEGLRV KGSAESLGLQ TTTSVVKSIF
LVIVLDGLFA VFFASIGL